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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4188 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
ATGCATCCCA TGCATCCCGA AAACCACGCC ATCGCCCGGA 
CCATCTCGCA GTCGGAGCAG CAGGATGTGG CTCCTGCCCG 
GTGGCCAGCA ATGGCCTGCC AGCAGTCAGA GGCCAGTACC 
CATCCCACGG ATCTGGTCGT TAAGAAGAAT GAACCCGCCA 
GGCAAGCCGG AACCCACCAT TGAGTGGTTT AAGGATGGCG 
AAGAAATCGC ACCGCGTCCA GTTCAAGGAC GGCGCCCTCT 
GGCAAGAAGG AGCAGGACGG CGGAGAGTAC TGGTGCGTGG 
GCCGTTAGTC GCCATGCCTC CCTCCAGATA GCTGTTTTGC 
CCCAAAGACA CGCGAGTGGC CAAAGGCGAG ACGGCTCTGC 
GGCATTCCAG AGCCAACGCT GATTTGGATA AAGGACGGCG 
GCCATGTCGT TTGGCGCCAG CTCCCGCGTT CGAATTGTGG 
AGCAATGTGG AGCCCATTGA TGAGGGCAAC TACAAGTGCA 
ACCCGCGAGA GCAGCTATGC CAAGCTGATT GTCCAGGTCA 


GCACGAGCAC 

CACTAATAAC 

60 

CCTGGCTGCT 

CCTCGTCCTG 

120 

AATCGCCACG 

TATCATCGAG 

180 

CGCTCAACTG 

CAAAGTGGAG 

240 

AACCCGTCAG 

CACCAACGAA 

300 

TCTTTTACAG 

GACAATGCAA 

360 

CCAAGAACCG 

AGTGGGCCAG 

420 

GCGACGATTT 

TCGCGTGGAG 

480 

TGGAGTGTGG 

GCCGCCCAAA 

540 

TTCCCTTGGA 

CGACCTGAAA 

600 

ACGGTGGCAA 

CCTGCTGATC 

660 

TTGCCCAGAA 

TCTGGTAGGC 

720 

AACCATACTT 

TATGAAGGAG 

780 


CCCAAGGATC AGGTGATGCT CTACGGCCAG ACAGCCACTT TCCACTGCTC AGTGGGCGGT 840 

GATCCGCCGC CGAAAGTGTT GTGGAAAAAG GAGGAGGGCA ATATTCCGGT GTCCAGAGCG 900 

CGAATCCTTC ACGACGAGAA AAGTTTAGAG ATATCCAACA TAACGCCCAC CGATGAGGGC 960 

ACCTATGTCT GCGAGGCACA CAACAATGTC GGTCAGATCA GCGCTAGGGC TTCTCTTATA 1020 

GTCCACGCTC CGCCGAACTT TACGAAAAGA CCCAGTAACA AGAAAGTGGG ACTAAATGGG 1080 

GTTGTCCAAC TACCTTGCAT GGCCTCCGGA AACCCTCCGC CGTCTGTATT CTGGACCAAG 1140. 

GAAGGAGTAT CCACTCTTAT GTTCCCAAAT AGTTCGCACG GAAGGCAGTA TGTGGCTGCC 1200 

GATGGAACTC TGCAGATTAC GGATGTGCGG CAGGAAGACG AAGGCTACTA TGTGTGTTCC 1260 

GCTTTCAGTG TAGTCGATTC CTCTACAGTA CGGGTTTTCC TGCAAGTCAG CTCGGTAGAC 1320 

GAGCGTCCAC CTCCGATTAT TCAAATCGGA CCTGCCAATC AAACACTGCC CAAGGGATCA 1380 

GTTGCTACTT TACCCTGTCG GGCCACTGGA AATCCCAGTC CCCGTATCAA GTGGTTCCAC 1440 

GATGGACATG CCGTACAAGC GGGCAATCGA TACAGCATCA TCCAAGGAAG CTCACTGAGA 1500 

GTCGATGACC TTCAACTAAG TGACTCTGGT ACCTACACCT GCACTGCATC TGGCGAACGA 1560 

GGAGAAACTT CCTGGGCTGC CACACTAACG GTGGAAAAAC CCGGTTCTAC ATCTCTTCAC 1620 

CGGGCAGCTG ATCCTAGCAC TTATCCTGCT CCTCCAGGAA CACCTAAAGT CCTGAATGTC 1680 

AGTCGCACCA GCATTAGTCT TCGTTGGGCT AAAAGCCAAG AGAAACCCGG AGCTGTGGGC 1740 

CCAATCATTG GATACACTGT AGAGTACTTC AGTCCGGATC TGCAAACTGG TTGGATTGTG 1800 

GCTGCCCATC GAGTCGGCGA CACTCAAGTC ACTATCTCGG GTCTCACTCC TGGCACTTCG 1860 

TATGTGTTCC TAGTTAGAGC TGAGAATACT CAGGGTATTT CTGTGCCTTC CGGCTTATCA 1920 

AATGTTATTA AAACCATTGA GGCAGATTTC GATGCAGCTT CTGCCAATGA TTTGTCAGCA 1980 

GCTCGAACTT TGCTGACAGG AAAGTCGGTG GAGCTAATAG ATGCCTCGGC TATCAATGCT 2040 

AGTGCCGTTA GACTTGAGTG GATGCTCCAC GTGAGCGCTG ATGAGAAATA CGTAGAGGGC 2100 

CTGCGCATAC ACTATAAGGA TGCCAGTGTA CCATCCGCAC AGTATCACTC GATCACTGTT 2160 

ATGGATGCCT CTGCAGAATC GTTTGTGGTG GGAAACCTTA AGAAGTACAC CAAGTATGAG 2220 

TTCTTCCTAA CACCCTTTTT TGAGACAATT GAAGGACAGC CCAGTAACTC CAAGACAGCC 22 80 

CTCACCTATG AAGATGTTCC CTCCGCACCA CCGGATAACA TTCAGATTGG CATGTACAAC 2340 

CAAACAGCCG GTTGGGTGCG TTGGACTCCG CCACCCTCCC AGCACCACAA TGGCAATTTG 2400 

TATGGCTACA AGATTGAGGT CAGCGCCGGT AACACCATGA AGGTGCTGGC CAATATGACT 2460 

CTTAATGCTA CCACCACATC TGTGCTCCTA AATAACCTAA CCACCGGAGC TGTGTACAGC 252 0 

GTGAGGTTGA ACTCCTTTAC CAAGGCAGGA GATGGACCTT ACTCCAAACC GATATCACTA 258 0 

TTCATGGACC CCACCCATCA TGTGCATCCG CCACGGGCAC ATCCAAGCGG CACCCATGAT 2640 

GGGCGACATG AGGGACAGGA TCTCACGTAT CATAACAATG GCAACATACC ACCTGGCGAC 2700 

ATTAATCCCA CCACTCATAA AAAGACCACT GACTACCTAT CTGGACCGTG GCTAATGGTG 2760 

CTGGTCTGCA TCGTTCTTCT AGTCCTGGTT ATTTCGGCGG CTATTTCGAT GGTCTACTTC 2820 

AAGCGCAAGC ATCAAATGAC CAAGGAATTG GGTCACTTAA GTGTGGTCAG TGACAACGAA 2880 

ATAACCGCAT TAAATATCAA TAGCAAAGAG AGCCTTTGGA TAGACCATCA TCGTGGATGG 2940 

CGAACTGCCG ATACTGACAA AGACTCAGGA TTAAGCGAAT CGAAGCTACT ATCCCACGTT 3000 

AACAGCAGTC AATCCAACTA CAATAACTCC GATGGAGGAA CCGATTATGC AGAAGTTGAC 3060 

ACCCGTAACC TTACCACCTT CTACAATTGT CGCAAGAGCC CCGATAATCC CACGCCGTAC 3120 

GCCACCACTA TGATCATTGG TACCTCTTCC AGTGAGACCT GCACCAAGAC AACATCTATA 3180 

AGTGCCGATA AGGACTCGGG AACTCATTCG CCCTATTCTG ACGCATTTGC CGGTCAGGTG 3240 

CCAGCGGTTC CTGTTGTCAA ATCCAACTAT CTTCAGTATC CGGTTGAACC GATCAACTGG 3300 

TCAGAGTTTC TACCCCCGCC GCCAGAACAC CCACCTCCGT CTTCTACCTA TGGATACGCA 3360 

CAAGGATCTC CTGAATCTTC GCGGAAGAGC TCCAAAAGCG CAGGTTCCGG CATTTCTACA 342 0 

AATCAAAGCA TTCTGAACGC ATCCATACAC AGCAGCTCCT CGGGCGGCTT TTCAGCTTGG 3480 

GGAGTATCGC CCCAATATGC TGTCGCCTGT CCACCGGAAA ACGTTTATAG CAATCCGCTG 3540 

TCGGCAGTGG CTGGCGGCAC CCAGAACCGC TATCAGATAA CGCCCACAAA CCAACATCCG 3600 

CCACAGTTAC CGGCCTACTT TGCCACCACG GGTCCAGGAG GAGCTGTACC ACCCAACCAC 3660 

CTGCCATTTG CCACACAGCG TCATGCAGCC AGCGAGTACC AGGCTGGACT GAATGCAGCG 3 720 

CGATGTGCCC AAAGCCGCGC CTGCAACAGC TGCGATGCCT TGGCCACACC CTCGCCCATG 3780 

CAACCCCCAC CGCCAGTTCC CGTACCCGAG GGCTGGTACC AACCGGTGCA TCCCAATAGC 384 0 

CACCCGATGC ACCCGACCTC CTCCAACCAC CAGATCTACC AGTGCTCCTC CGAGTGCTCG 3900 

GATCACTCGA GGAGCTCGCA GAGTCACAAG CGGCAGCTGC AGCTCGAGGA GCACGGCAGC 3 960 

AGTGCCAAAC AACGCGGAGG ACACCACCGT CGACGAGCCC CGGTGGTGCA GCCGTGCATG 4 020 


GAGAGCGAGA ACGAGAACAT GCTGGCGGAG TACGAGCAGC GCCAGTACAC CAGCGATTGC 4080 
TGCAATAGCT CCCGCGAGGG CGACACCTGC TCCTGCAGCG AGGGATCCTG TCTTTACGCC 4140 
GAGGCGGGCG AGCCGGCGCC TCGTCAAATG ACTGCTAAGA ACACCTAA 4188 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 


Met 

His 

Pro 

Met 

His 

Pro 

Glu 

Asn 

His 

Ala 

He Ala Arg Ser Thr Ser 

1 




5 





10 

15 

Thr 

Thr 

Asn 

Asn 

Pro 

Ser 

Arg 

Ser 

Arg 

Ser 

Ser Arg Met Trp Leu Leu 




20 





25 


30 

Pro 

Ala 

Trp 

Leu 

Leu 

Leu 

Val 

Leu 

Val 

Ala 

Ser Asn Gly Leu Pro Ala 



35 





40 



45 

Val 

Arg 

Gly 

Gin 

Tyr 

Gin 

Ser 

Pro 

Arg 

He 

He Glu His Pro Thr Asp 


50 





55 




60 

Leu 

Val 

Val 

Lys 

Lys 

Asn 

Glu 

Pro 

Ala 

Thr 

Leu Asn Cys Lys Val Glu 

65 





70 





75 80 

Gly 

Lys 

Pro 

Glu 

Pro 

Thr 

He 

Glu 

Trp 

Phe 

Lys Asp Gly Glu Pro Val 





85 





90 

95 

Ser 

Thr 

Asn 

Glu 

Lys 

Lys 

Ser 

His 

Arg 

Val 

Gin Phe Lys Asp Gly Ala 




100 





105 


110 

Leu 

Phe 

Phe 

Tyr 

Arg 

Thr 

Met 

Gin 

Gly 

Lys 

Lys Glu Gin Asp Gly Gly 



115 





120 



125 

Glu 

Tyr 

Trp 

Cys 

Val 

Ala 

Lys 

Asn 

Arg 

Val 

Gly Gin Ala Val Ser Arg 


130 





135 




140 

His 

Ala 

Ser 

Leu 

Gin 

He 

Ala 

Val 

Leu 

Arg 

Asp Asp Phe Arg Val Glu 

145 





150 





155 160 

Pro 

Lys 

Asp 

Thr 

Arg 

Val 

Ala 

Lys 

Gly 

Glu 

Thr Ala Leu Leu Glu Cys 





165 





170 

175 

Gly 

Pro 

Pro 

Lys 

Gly 

He 

Pro 

Glu 

Pro 

Thr 

Leu He Trp He Lys Asp 




180 





185 


190 

Gly 

Val 

Pro 

Leu 

Asp 

Asp 

Leu 

Lys 

Ala 

Met 

Ser Phe Gly Ala Ser Ser 



195 





200 



205 

Arg 

Val 

Arg 

He 

Val 

Asp 

Gly 

Gly 

Asn 

Leu 

Leu He Ser Asn Val Glu 


210 





215 




220 

Pro 

He 

Asp 

Glu 

Gly 

Asn 

Tyr 

Lys 

Cys 

He 

Ala Gin Asn Leu Val Gly 

225 





230 





235 240 

Thr 

Arg 

Glu 

Ser 

Ser 

Tyr 

Ala 

Lys 

Leu 

He 

Val Gin Val Lys Pro Tyr 





245 





250 

255 

Phe 

Met 

Lys 

Glu 

Pro 

Lys 

Asp 

Gin 

Val 

Met 

Leu Tyr Gly Gin Thr Ala 




260 





265 


270 

Thr 

Phe 

His 

Cys 

Ser 

Val 

Gly 

Gly 

Asp 

Pro 

Pro Pro Lys Val Leu Trp 



275 





280 



285 

Lys 

Lys 

Glu 

Glu 

Gly 

Asn 

He 

Pro 

Val 

Ser 

Arg Ala Arg He Leu His 


290 





295 




300 

Asp 

Glu 

Lys 

Ser 

Leu 

Glu 

He 

Ser 

Asn 

He 

Thr Pro Thr Asp Glu Gly 

305 





310 





315 320 

Thr 

Tyr 

Val 

Cys 

Glu 

Ala 

His 

Asn 

Asn 

Val 

Gly Gin He Ser Ala Arg 





325 





330 

335 


Aia 

oci Leu lie vai 




740 
J *± U 



Asn 

T t rr-i T trc» Val /""2l 

byS JjyS Vdl oJ.y 

JjcU 

rloll VJiy 




360 

Ser 

LiXy Asn fro fro 

Pro 

ocX val 


j / u 



Tnr 

Leu. Met. riic fro 

Asn 

OCl OCX. 

Job 




Asp 

Gly Tnr Leu uin 

lie 

inr Asp 


a n, ci 



Tyr 

val cys ber Ala 

Fne 

Oqv Va "1 


49 0 
ft z u 



Phe 

Leu Gin vai ber 

Ser 

Val Asp 


A 1 C 


440 

lie 

Gly Pro Ala Asn 

r«l t-> 

Gin 

Thr Leu 


450 


4bb 

Pro 

Cys Arg Ala Thr 

/IT,, 

Gly 

Asn Pro 

465 


/ *7 A 
4 /U 


Asp 

Gly His Ala Val 

Gin 

«1 > p 1 t r 

Ala Gly 


a a c 
4ob 



Ser 

Ser Leu Arg Val 

Asp 

Asp Leu 


c a a 

500 



Thr 

Cys Thr Ala Ser 

pi i 

Cjriy 

tjiu Arg 


515 



Leu 

Thr Val Glu Lys 

Pro 

Gly ber 


530 


bob 

Pro 

Ser Thr Tyr Pro 

Ala 

Pro Pro 

545 


CCA 


Ser 

Arg Thr Ser lie 

Ser 

Leu Arg 


c c cr 
bob 



Gly 

Ala Val Gly Pro 

lie 

lie oiy 


C O A 
DOO 



Asp 

Leu Gin Thr Gly 

Trp 

T 1 n T 7— » 1 

lie vai 


595 


bUU 

Gin 

Val Thr lie Ser 

Pi * r 

Gly 

Leu inr 


<r i A 

610 


DXb 

Val 

Arg Ala Glu Asn 

rpl_ „ 

Tnr 

Gin Gly 

625 




Asn 

Val lie Lys Tnr 

lie 

Cjiu Aia 


C A C 

o4b 



Asp 

Leu Ser Ala Ala 

Arg 

inr beu 


bbU 



lie 

Asp Ala Ser Ala 

lie 

Asn Ala 


675 


DOU 

Leu 

His Val Ser Ala 

Asp 

Glu Lys 


A A 

6 90 


C Q c. 

Tyr 

Lys Asp Ala Ser 

• Val 

Pro Ser 

705 


«-» 1 A 
/ 10 


Met 

Asp Ala Ser Ala 

Glu 

Ser Phe 


725 



Thr 

Lys Tyr Glu Phe 

Phe 

Leu Thr 


740 



Gin 

Pro Ser Asn Ser 

Lys 

Thr Ala 


755 


760 


rlO ASn 

flic 

mi uyo iiiy 

Pro 

IT 1 \J 

OCl 

74^ 


350 



\r a l t/al 
Val val 

m n 

T.^ii Drn Cxrc 
Jjcu riu Lyo 


ai a 



365 



fl lc 1 J. |J 

iin 

uyo ux u vjijr 

Val 

Ser 



380 



u-i c -v/ 
nib vjiy 

Arg 

win lyi vai 

Ala 

ril a 

Ala 





400 

val Aiy 


ulU nbp ulU 

vji y 


41 n 
*± 1 V 



41 R 
1 1 j 


Val ASp 

Gov 

Gov TVi >- TTra 1 

oci nil vai 

Arg 

vai 

49R 





vj±u Arg 

Pro 

rlO riU lie 

Tl ^ 
lie 

pi n 
bill 



44 R 



Pro Lys 

uiy 

Ccy i; 3 1 Jl 3 

ber vai Aia 

inr 

Leu 



4^n 



Ser Pro 

Arg 

lie Lys Trp 

pne 

HIS 


*7 Q 

^ / b 



4 p n 
*± o u 

Asn Arg 

Tyr 

oer lie lie 

Pi -r-i 

uin 

p i _ . 
oiy 




fiyb 


Gin Leu 

Ser 

Asp ber Cjiy 

inr 

Tyr 

C A C 

bUb 





m , . pi ,, 

Gly Glu 

Thr 

ber irp Aia 

7Al a 

Aia 

inr 






Thr Ser 

Leu 

his Arg Aia 

7Al a 

Aia 

Asp 



DftU 



(aiy inr 

Pro 

Liys Val Lieu 

Asn 

Val 
Val 


bbb 




Trp Ala 

Lys 

0*^'»»* Pi T - ! P) 1 11 

ber vjin yiu 

Lys 

Pro 

c. *7 A 
D / U 



j / j 


Tyr Thr 

vai 

p lii r-v^ n Vi ^ 

ulU iyr rne 

Ser 

Pro 

bob 


con 



Aia Aia 

TJ -l _ 

rllS 

Arg vai biy 

Asp 

1 11X 



DUD 



Pro Gly 

inr 

ber iyr vai 

rile 

Leu 



OZU 



lie Ser 

vai 

Fro ber Liiy 

Leu 

Ser 





S4H 

o *± u 

Asp Phe 

Asp 

Aia Aia ber 

Ala 
Ala 

Asn 

rcn 

o bu 



O J J 


Leu Thr 

pi , r 
oiy 

ijys ber vai 

ulU 

Leu 

665 


D / u 



Ser Ala 

vai 

Arg Leu Glu 

Trp 

rlct 



/roc 
DOD 



Tyr vai 

pi 

CjIU 

oiy ueu Arg 

lie 

nis 



nnn 
/ UU 



Ala Gin 

Tyr 

His ber lie 

i nr 

Val 


/ID 



790 

Val Val 

Gly 

Asn Leu Lys 

Lys 

Tyr 

730 



735 


Pro Phe 

Phe 

Glu Thr He 

Glu 

Gly 

745 


750 



Leu Thr 

Tyr 

Glu Asp Val 

Pro 

Ser 


765 


Ala 
Ala 

Pro 

XT I. (J 

Asp 

A toil 

Tie* 

x x c 

Gin 


vjx y 

Met 

Tvr 

xy x 

Asn 

Gin 

Thr 

Ala 

Glv 

\JX J 


77D 





775 





780 





T m 

V CL X 

Axg 

Trr> 
xxp 

Thr 

X XXX 

Pro 

Pro 

Pro 

Ser 

Gin 

His 

His 

Asn 

Gly 

Asn 

Leu 

7Q 





790 





795 





800 

iyr 

ui y 

Tvr" 
iyx 

xjy o 

He 

Glu 

Val 

Ser 

Ala 

Glv 

Asn 

Thr 

Met 

Lvs 

Val 

Leu 





805 





810 





815 


Hid 

A C T~l 


Thr 

Leu 

Asn 

Ala 

Thr 

Thr 

Thr 

Ser 

Val 

Leu 

Leu 

Asn 

Asn 




820 





825 





830 




Thr 

X XIX 

Thr 

X XXX 

Glv 
oxy 

Ala 

Val 

Tvr 

Ser 

Val 

Aro 

Leu 

Asn 

Ser 

Phe 

Thr 

Lys 



835 





840 





845 




riX ct 

vjiy 


ox y 

Pro 

Tvr 
xy x 

Ser 

Lys 

Pro 

He 

Ser 

Leu 

Phe 

Met 

ASD 

Pro 


850 





855 





860 





Th y~ 
1 11L 

nio 


Val 

v d x 

His 

Pro 

Pro 

Arg 

Ala 

His 

Pro 

Ser 

Glv 

VJX J 

Thr 

His 

Asp 

OuJ 





870 





875 





880 

ijxy 

Arg 

ni o 

OX IX 

wx y 

m n 

OX XX 

Asp 

Leu 

Thr 

Tvr 

xy x 

His 

Asn 

Asn 

Glv 

Asn 

lie 





O O —J 





890 





895 


Pro 

Pro 

ml \r 

vj±y 


X X c 

Aon 

Pro 

Thr 

Thr 

His 

Lys 

Lys 

Thr 

Thr 

Asp 

Tvr 

xy x 









905 





910 



Leu 

Ser 

vjiy 

Pro 

Trp 

T .01 1 

Met- 
ric L. 

v^l 

val 

T .^1 1 
ucu 

Val 

V CLX 


He 

X xc 

V^l 

V CLX 

Leu 

Leu 

Val 



y Id 










j £1 




Leu 

vai 

lie 

Ser 

Aid 

ax a 

Tip 

lie 

ocx 

Mpj- 
rlCL 

V^l 
val 

xyr 

Phe 

xjy 0 

Axy 

Lys 

His 


y o u 





935 





940 






Met 

Thr 

Lys 

olU 

Leu 

ui y 

n x 0 

XJG u. 

Cpv 
OCX 

Val 

Val 

Ser 

Asp 

Asn 

Glu 






_7 -3 VJ 





955 





960 

lie 

Txir 

Aia 

Leu 

Asn 

lie 

Asn 

Ser 

Lys 

ul U. 

OCX 

T i 

XjCIX 

Trp 

He 

Asp 

His 










970 





975 


nlS 

Arg 

Gly 

Trp 

ax y 

Thr- 
1 xxX 

Ala 

Asp 

Thr 

A CTi 
nop 

Lys 

Asp 

Ser 

Gly 

Leu 

Ser 









985 





990 



Glu 

Ser 

Lys 

Leu 

Leu 

Ser 

His 

Val 

Asn 

OCX 

Ser 

Gin 

Ser 

Asn 

Tvr 

x y x 

Asn 



995 





1000 




1005 



Asn 

Ser 

Asp 

<jiy 

Gly 

Thr 

Asp 

Tyr Ala 

Glu 

Val 

Asp 

Thr 

Arg 

Asn 

Leu 


1010 




1015 




1020 




Thr 

Thr 

Phe 

Tyr 

Asn 

Cys 

Arg 

Lys 

Ser 

Pro 

Asp 

Asn 

Pro 

Thr 

Pro 

Tyr 

1025 




1030 




1035 




104( 

Ala 

Thr 

Thr 

Met 

He 

He Gly Thr 

Ser 

Ser 

Ser 

Glu 

Thr 

Cys 

Thr 

Lys 





1045 




1050 




1055 

Thr 

Thr 

Ser 

He 

Ser Ala Asp Lys Asp 

Ser Gly Thr His 

Ser 

Pro 

Tyr 




1060 




1065 




1070 


Ser Asp 

Ala 

Phe Ala Gly Gin 

Val 

Pro 

Ala 

Val 

Pro 

Val 

Val 

Lys 

Ser 


1075 1080 1085 


Asn Tyr Leu Gin Tyr Pro Val Glu Pro He Asn Trp Ser Glu Phe Leu 

1090 1095 1100 

Pro Pro Pro Pro Glu His Pro Pro Pro Ser Ser Thr Tyr Gly Tyr Ala 
1105 1110 1115 1120 

Gin Gly Ser Pro Glu Ser Ser Arg Lys Ser Ser Lys Ser Ala Gly Ser 

1125 1130 1135 

Gly lie Ser Thr Asn Gin Ser He Leu Asn Ala Ser He His Ser Ser 

1140 1145 1150 

Ser Ser Gly Gly Phe Ser Ala Trp Gly Val Ser Pro Gin Tyr Ala Val 

1155 1160 1165 

Ala Cys Pro Pro Glu Asn Val Tyr Ser Asn Pro Leu Ser Ala Val Ala 

1170 1175 1180 

Gly Gly Thr Gin Asn Arg Tyr Gin He Thr Pro Thr Asn Gin His Pro 
1185 1190 1195 1200 


1 


Pro Gin Leu Pro Ala Tyr Phe Ala Thr Thr Gly Pro Gly Gly Ala Val 

1205 1210 1215 

Pro Pro Asn His Leu Pro Phe Ala Thr Gin Arg His Ala Ala Ser Glu 

1220 1225 1230 

Tyr Gin Ala Gly Leu Asn Ala Ala Arg Cys Ala Gin Ser Arg Ala Cys 

1235 1240 1245 

Asn Ser Cys Asp Ala Leu Ala Thr Pro Ser Pro Met Gin Pro Pro Pro 

1250 1255 1260 

Pro Val Pro Val Pro Glu Gly Trp Tyr Gin Pro Val His Pro Asn Ser 
1265 1270 1275 1280 

His Pro Met His Pro Thr Ser Ser Asn His Gin lie Tyr Gin Cys Ser 

1285 1290 1295 

Ser Glu Cys Ser Asp His Ser Arg Ser Ser Gin Ser His Lys Arg Gin 

1300 1305 1310 

Leu Gin Leu Glu Glu His Gly Ser Ser Ala Lys Gin Arg Gly Gly His 

1315 1320 1325 

His Arg Arg Arg Ala Pro Val Val Gin Pro Cys Met Glu Ser Glu Asn 

1330 1335 1340 

Glu Asn Met Leu Ala Glu Tyr Glu Gin Arg Gin Tyr Thr Ser Asp Cys 
1345 1350 1355 . 1360 

Cys Asn Ser Ser Arg Glu Gly Asp Thr Cys Ser Cys Ser Glu Gly Ser 

1365 1370 1375 

Cys Leu Tyr Ala Glu Ala Gly Glu Pro Ala Pro Arg Gin Met Thr Ala 
1380 1385 1390 

Lys Asn Thr 
1395 


(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGTGAAAATC CACGCATCAT CGAGCATCCC ATGGACACGA CGGTGCCAAA AAATGATCCA 60 

TTTACGTTTA ATTGCCAGGC CGAGGGCAAT CCAACACCAA CCATTCAATG GTTTAAGGAC 12 0 

GGTCGCGAAC TGAAGACGGA TACGGGTTCG CATCGCATAA TGCTGCCCGC CGGGGGTCTA 180 

TTCTTTCTCA AGGTTATCCA CTCACGTAGA GAGAGCGATG CGGGCACTTA CTGGTGCGAG 240 

GCCAAAAACG AGTTTGGAGT GGCACGGTCC AGGAATGCAA CGTTGCAAGT GGCAGTTCTC 300 

CGCGACGAAT TCCGTTTGGA GCCGGCAAAT ACCCGCGTGG CCCAAGGCGA GGTGGCCCTG 360 

ATGGAATGCG GTGCCCCCCG AGGATCTCCG GAGCCGCAAA TCTCGTGGCG CAAGAACGGC 420 

CAGACCCTGA ATCTTGTCGG GAACAAGCGG ATTCGCATTG TCGACGGTGG CAATCTGGCC 480 

ATCCAGGAAG CCCGCCAATC GGACGACGGA CGCTACCAGT GTGTGGTCAA GAATGTGGTT 540 

GGCACCCGGG AGTCGGCCAC CGCTTTTCTT AAAGTGCATG TACGTCCATT CCTCATCCGA 600 

GGACCCCAGA ATCAGACGGC GGTGGTGGGC AGCTCGGTGG TCTTCCAGTG CCGCATCGGA 660 

GGCGATCCCC TGCCTGATGT CCTGTGGCGA CGCACTGCCT CCGGCGGCAA TATGCCACTG 720 

CGTAAGTTTT CTTGGCTTCA TTCAGCTTCA GGTCGTGTGC ACGTACTTGA GGACCGCAGT 780 

CTGAAGCTGG ACGACGTTAC TCTGGAGGAC ATGGGCGAGT ACACTTGCGA GGCGGACAAT 840 

GCGGTGGGCG GCATCACGGC CACTGGCATC CTCACCGTTC ACGCTCCCCC CAAATTTGTG 900 

ATACGCCCCA AGAATCAGCT GGTGGAGATC GGTGATGAAG TGCTGTTCGA GTGCCAAGCG 960 

AATGGACATC CCCGACCAAC GCTCTACTGG TCGGTGGAGG GCAACAGCTC CCTGCTGCTC 102 0 

CCCGGCTATC GGGATGGCCG CATGGAAGTG ACCCTGACGC CCGAGGGGCG CTCGGTGCTC 1080 

TCGATAGCTC GATTTGCCCG TGAGGATTCC GGAAAGGTGG TCACTTGCAA CGCCCTGAAC 114 0 


GCCGTGGGCA GCGTCAGCAG TCGGACTGTG GTCAGTGTGG ATACGCAATT CGAGCTGCCA 1200 

CCGCCGATTA TCGAACAGGG GCCCGTGAAT CAAACGTTGC CCGTTAAATC AATTGTGGTT 1260 

CTGCCATGCC GAACTCTGGG CACTCCAGTG CCACAGGTCT CTTGGTACCT GGATGGCATA 1320 

CCCATCGATG TGCAGGAGCA CGAGCGGCGG AATCTTTCGG ACGCTGGAGC CTTAACCATT 1380 

TCGGATCTTC AGCGCCACGA GGATGAAGGC TTGTACACCT GCGTGGCCAG CAATCGCAAC 1440 

GGAAAATCCT CTTGGAGTGG TTACCTTCGT CTGGACACCC CGACAAATCC GAATATCAAG 1500 

TTCTTCAGAG CCCCAGAACT TTCCACCTAC CCAGGGCCGC CAGGAAAACC GCAAATGGTG 1560 

GAGAAGGGCG AAAATTCGGT GACTCTCAGC TGGACGAGGA GCAACAAGGT GGGCGGCTCC 1620 

AGTCTGGTGG GCTATGTAAT CGAGATGTTT GGCAAAAACG AAACGGATGG CTGGGTGGCT 1680 

GTGGGCACTA GGGTGCAAAA TACCACGTTT ACCCAAACGG GTCTGCTGCC GGGTGTGAAT 1740 

TACTTCTTTC TAATTCGAGC CGAGAACTCC CATGGCTTAT CACTGCCCAG TCCGATGTCG 1800 

GAACCCATTA CGGTGGGAAC GCGCTACTTC AATAGTGGTC TGGATCTGAG CGAGGCTCGT 1860 

GCCAGTCTGC TGTCCGGAGA TGTTGTGGAG CTGAGCAACG CCAGTGTGGT GGACTCCACT 1920 

AGCATGAAAC TCACCTGGCA GATCATCAAT GGCAAATACG TCGAGGGCTT CTATGTCTAT 1980 

GCGAGACAGT TGCCAAATCC AATAGTCAAC AATCCGGCGC CCGTTACTAG CAATACCAAT 2040 

CCGCTGCTGG GCTCTACATC CACATCCGCA TCCGCATCCG CCTCGGCATC GGCATTGATT 2100 

TCGACAAAGC CAAATATTGC AGCTGCCGGC AAACGTGATG GGGAGACAAA CCAGAGTGGA 2160 

GGAGGAGCTC CGACCCCACT GAACACCAAG TATCGCATGC TAACGATTCT CAATGGCGGT 2220 

GGCGCCTCAT CCTGCACCAT CACCGGGCTC GTCCAGTACA CGCTGTATGA ATTTTTCATC 2280 

GTGCCATTTT ACAAATCCGT CGAGGGCAAG CCGTCGAATT CGCGCATCGC TCGCACCCTT 2340 

GAAGATGTTC CCTCTGAGGC ACCATATGGA ATGGAGGCTC TGCTGTTGAA CTCCTCCGCG 2400 

GTCTTCCTCA AATGGAAGGC ACCAGAACTC AAGGATCGGC ATGGTGTTCT CTTGAACTAT 2460 

CATGTTATAG TCCGAGGTAT TGACACTGCC CACAATTTCT CACGCATTTT GACAAATGTC 2520 

ACCATCGATG CCGCTTCGCC TACTCTGGTT TTGGCCAATC TCACCGAAGG CGTCATGTAC 2580 

ACCGTGGGCG TGGCGGCCGG AAATAACGCT GGAGTTGGTC CTTATTGTGT CCCAGCTACT 2640 

TTGCGTTTGG ATCCCATCAC AAAGCGACTC GATCCGTTCA TCAATCAGCG GGACCATGTT 2700 

AACGATGTGC TGACGCAGCC CTGGTTCATA ATACTCCTGG GCGCCATCCT GGCCGTTCTT 2760 

ATGCTGTCCT TTGGCGCAAT GGTCTTTGTG AAGCGCAAGC ACATGATGAT GAAGCAGTCG 2820 

GCCCTAAATA CAATGCGTGG CAATCACACG AGCGACGTGC TCAAAATGCC GAGTCTATCG 2880 

GCGCGCAATG GAAACGGCTA CTGGCTGGAC TCCTCCACCG GCGGAATGGT GTGGCGTCCC 2940 

TCGCCCGGCG GCGACTCGCT GGAGATGCAA AAGGATCACA TCGCCGACTA TGCGCCGGTC 3000 

TGCGGTGCCC CCGGTTCTCC GGCCGGCGGT GGCACCTCTT CCGGTGGATC CGGTGGCGCG 3060 

GGCAGCGGTG CCAGCGGCGG CGATGACATT CATGGAGGAC ACGGCAGCGA ACGCAATCAG 3120 

CAGCGGTACG TGGGCGAGTA CTCCAACATA CCGACCGACT ATGCAGAGGT GTCCAGTTTT 3180 

GGCAAGGCAC CCAGCGAGTA TGGTCGGCAT GGCAACGCCT CCCCGGCCCC TTATGCCACC 3240 

TCTTCGATCC TGAGTCCCCA CCAGCAGCAA CAGCAGCAGC AGCCGCGTTA TCAACAGCGA 3300 

CCAGTGCCCG GCTATGGGCT CCAGCGCCCA ATGCACCCAC ACTACCAGCA GCAGCAGCAT 3360 

CAGCAGCAAC AGGCGCAGCA GACGCACCAG CAACACCAGG CTCTCCAGCA GCACCAGCAA 3420 

CTGCCACCCA GCAACATCTA CCAGCAGATG TCCACCACCA GCGAGATATA CCCCACGAAC 3480 

ACGGGTCCTT CGCGCTCTGT CTACTCTGAG CAGTATTACT ACCCCAAGGA CAAGCAGAGA 3540 

CACATCCACA TCACCGAGAA CAAGCTGAGC AACTGCCACA CCTATGAGGC GGCTCCTGGC 3600 

GCCAAGCAGT CCTCGCCGAT ATCCTCGCAG TTCGCCAGCG TGAGGCGGCA GCAGCTGCCG 3660 

CCCAACTGCA GCATCGGCAG GGAAAGTGCC CGCTTCAAGG TGCTAAACAC GGATCAGGGC 3720 

AAGAACCAGC AGAATCTCCT GGATCTCGAC GGCTCCTCGA TGTGCTACAA CGGTCTGGCA 3780 

GACTCGGGCT GCGGTGGATC TCCCTCCCCG ATGGC CATGC TGATGTCGCA CGAGGACGAG 3840 

CACGCGCTGT ACCACACGGC GGATGGGGAT CTGGACGACA TGGAACGACT GTACGTCAAG 3900 

GTGGACGAGC AGCAGCCTCC ACAGCAGCAG CAGCAGCTGA TTCCCCTGGT CCCACAGCAT 3 960 

CCGGCGGAAG GTCACCTGCA GTCCTGGCGG AATCAGAGCA CGCGGAGCAG TCGGAAGAAC 4020 

GGCCAGGAAT GCATCAAGGA ACCCAGCGAG TTGATCTACG CTCCGGGAAG CGTGGCCAGC 4080 

GAACGGAGCC TCCTCAGCAA CTCGGGTAGC GGCACCAGCA GCCAGCCAGC TGGCCACAAT 4140 

GTCTGA 4146 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 


i 


(A) LENGTH: 1381 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Gly Glu Asn Pro Arg lie lie Glu His Pro Met Asp Thr Thr Val Pro 

1 5 10 15 

Lys Asn Asp Pro Phe Thr Phe Asn Cys Gin Ala Glu Gly Asn Pro Thr 

20 25 30 

Pro Thr lie Gin Trp Phe Lys Asp Gly Arg Glu Leu Lys Thr Asp Thr 

35 40 45 

Gly Ser His Arg lie Met Leu Pro Ala Gly Gly Leu Phe Phe Leu Lys 

50 55 60 

Val lie His Ser Arg Arg Glu Ser Asp Ala Gly Thr Tyr Trp Cys Glu 
65 70 75 80 

Ala Lys Asn Glu Phe Gly Val Ala Arg Ser Arg Asn Ala Thr Leu Gin 

85 90 95 

Val Ala Val Leu Arg Asp Glu Phe Arg Leu Glu Pro Ala Asn Thr Arg 

100 105 110 

Val Ala Gin Gly Glu Val Ala Leu Met Glu Cys Gly Ala Pro Arg Gly 

115 120 125 

Ser Pro Glu Pro Gin lie Ser Trp Arg Lys Asn Gly Gin Thr Leu Asn 

130 135 140 

Leu Val Gly Asn Lys Arg lie Arg lie Val Asp Gly Gly Asn Leu Ala 
145 150 155 160 

lie Gin Glu Ala Arg Gin Ser Asp Asp Gly Arg Tyr Gin Cys Val Val 

165 170 175 

Lys Asn Val Val Gly Thr Arg Glu Ser Ala Thr Ala Phe Leu Lys Val 

180 185 190 

His Val Arg Pro Phe Leu lie Arg Gly Pro Gin Asn Gin Thr Ala Val 

195 200 205 

Val Gly Ser Ser Val Val Phe Gin Cys Arg lie Gly Gly Asp Pro Leu 

210 215 220 

Pro Asp Val Leu Trp Arg Arg Thr Ala Ser Gly Gly Asn Met Pro Leu 
225 230 235 240 

Arg Lys Phe Ser Trp Leu His Ser Ala Ser Gly Arg Val His Val Leu 

245 250 255 

Glu Asp Arg Ser Leu Lys Leu Asp Asp Val Thr Leu Glu Asp Met Gly 

260 265 270 

Glu Tyr Thr Cys Glu Ala Asp Asn Ala Val Gly Gly lie Thr Ala Thr 

275 280 285 

Gly lie Leu Thr Val His Ala Pro Pro Lys Phe Val lie Arg Pro Lys 

290 295 300 

Asn Gin Leu Val Glu lie Gly Asp Glu Val Leu Phe Glu Cys Gin Ala 
305 310 315 320 

Asn Gly His Pro Arg Pro Thr Leu Tyr Trp Ser Val Glu Gly Asn Ser 

325 330 335 

Ser Leu Leu Leu Pro Gly Tyr Arg Asp Gly Arg Met Glu Val Thr Leu 

340 345 350 

Thr Pro Glu Gly Arg Ser Val Leu Ser lie Ala Arg Phe Ala Arg Glu 

355 360 365 

Asp Ser Gly Lys Val Val Thr Cys Asn Ala Leu Asn Ala Val Gly Ser 
370 375 380 


Val Ser Ser Arg Thr Val Val Ser Val Asp. Thr Gin Phe Glu Leu Pro 
385 390 395 400 

Pro Pro lie lie Glu Gin Gly Pro Val Asn Gin Thr Leu Pro Val Lys 

405 410 415 

Ser lie Val Val Leu Pro Cys Arg Thr Leu Gly Thr Pro Val Pro Gin 

420 425 430 

Val Ser Trp Tyr Leu Asp Gly lie Pro lie Asp Val Gin Glu His Glu 

435 440 445 

Arg Arg Asn Leu Ser Asp Ala Gly Ala Leu Thr lie Ser Asp Leu Gin 

450 455 460 

Arg His Glu Asp Glu Gly Leu Tyr Thr Cys Val Ala Ser Asn Arg Asn 
465 470 475 480 

Gly Lys Ser Ser Trp Ser Gly Tyr Leu Arg Leu Asp Thr Pro Thr Asn 

485 490 495 

Pro Asn lie Lys Phe Phe Arg Ala Pro Glu Leu Ser Thr Tyr Pro Gly 

500 505 510 

Pro Pro Gly Lys Pro Gin Met Val Glu Lys Gly Glu Asn Ser Val Thr 

515 520 525 

Leu Ser Trp Thr Arg Ser Asn Lys Val Gly Gly Ser Ser Leu Val Gly 

530 535 540 

Tyr Val lie Glu Met Phe Gly Lys Asn Glu Thr Asp Gly Trp Val Ala 
545 550 555 560 

Val Gly Thr Arg Val Gin Asn Thr Thr Phe Thr Gin Thr Gly Leu Leu 

565 570 575 

Pro Gly Val Asn Tyr Phe Phe Leu He Arg Ala Glu Asn Ser His Gly 

580 585 590 

Leu Ser Leu Pro Ser Pro Met Ser Glu Pro He Thr Val Gly Thr Arg 

595 600 605 

Tyr Phe Asn Ser Gly Leu Asp Leu Ser Glu Ala Arg Ala Ser Leu Leu 

610 615 620 

Ser Gly Asp Val Val Glu Leu Ser Asn Ala Ser Val Val Asp Ser Thr 
625 630 635 640 

Ser Met Lys Leu Thr Trp Gin He He Asn Gly Lys Tyr Val Glu Gly 

645 650 655 

Phe Tyr Val Tyr Ala Arg Gin Leu Pro Asn Pro He Val Asn Asn Pro 

660 665 670 

Ala Pro Val Thr Ser Asn Thr Asn Pro Leu Leu Gly Ser Thr Ser Thr 

675 680 685 

Ser Ala Ser Ala Ser Ala Ser Ala Ser Ala Leu He Ser Thr Lys Pro 

690 695 700 

Asn He Ala Ala Ala Gly Lys Arg Asp Gly Glu Thr Asn Gin Ser Gly 
705 710 715 720 

Gly Gly Ala Pro Thr Pro Leu Asn Thr Lys Tyr Arg Met Leu Thr He 

725 730 735 

Leu Asn Gly Gly Gly Ala Ser Ser Cys Thr He Thr Gly Leu Val Gin 

740 745 750 

Tyr Thr Leu Tyr Glu Phe Phe He Val Pro Phe Tyr Lys Ser Val Glu 

755 760 765 

Gly Lys Pro Ser Asn Ser Arg He Ala Arg Thr Leu Glu Asp Val Pro 

770 775 780 

Ser Glu Ala Pro Tyr Gly Met Glu Ala Leu Leu Leu Asn Ser Ser Ala 
785 790 795 800 

Val Phe Leu Lys Trp Lys Ala Pro Glu Leu Lys Asp Arg His Gly Val 
805 810 815 


i ) 


! ) 


Leu Leu Asn Tyr His Val lie Val Arg Gly lie Asp Thr Ala His Asn 

820 825 830 

Phe Ser Arg lie Leu Thr Asn Val Thr lie Asp Ala Ala Ser Pro Thr 

835 840 845 

Leu Val Leu Ala Asn Leu Thr Glu Gly Val Met Tyr Thr Val Gly Val 

850 855 860 

Ala Ala Gly Asn Asn Ala Gly Val Gly Pro Tyr Cys Val Pro Ala Thr 
865 870 875 880 

Leu Arg Leu Asp Pro lie Thr Lys Arg Leu Asp Pro Phe lie Asn Gin 

885 890 895 

Arg Asp His Val Asn Asp Val Leu Thr Gin Pro Trp Phe lie lie Leu 

900 905 910 

Leu Gly Ala lie Leu Ala Val Leu Met Leu Ser Phe Gly Ala Met Val 

915 920 925 

Phe Val Lys Arg Lys His Met Met Met Lys Gin Ser Ala Leu Asn Thr 

930 935 940 

Met Arg Gly Asn His Thr Ser Asp Val Leu Lys Met Pro Ser Leu Ser 
945 950 955 960 

Ala Arg Asn Gly Asn Gly Tyr Trp Leu Asp Ser Ser Thr Gly Gly Met 

965 970 975 

Val Trp Arg Pro Ser Pro Gly Gly Asp Ser Leu Glu Met Gin Lys Asp 

980 985 990 

His lie Ala Asp Tyr Ala Pro Val Cys Gly Ala Pro Gly Ser Pro Ala 

995 1000 1005 

Gly Gly Gly Thr Ser Ser Gly Gly Ser Gly Gly Ala Gly Ser Gly Ala 

1010 1015 1020 

Ser Gly Gly Asp Asp lie His Gly Gly His Gly Ser Glu Arg Asn Gin 
1025 1030 " 1035 1040 

Gin Arg Tyr Val Gly Glu Tyr Ser Asn lie Pro Thr Asp Tyr Ala Glu 

1045 1050 1055 

Val Ser Ser Phe Gly Lys Ala Pro Ser Glu Tyr Gly Arg His Gly Asn 

1060 1065 1070 

Ala Ser Pro Ala Pro Tyr Ala Thr Ser Ser lie Leu Ser Pro His Gin 

1075 1080 1085 

Gin Gin Gin Gin Gin Gin Pro Arg Tyr Gin Gin Arg Pro Val Pro Gly 

1090 1095 1100 

Tyr Gly Leu Gin Arg Pro Met His Pro His Tyr Gin Gin Gin Gin His 
1105 1110 1115 1120 

Gin Gin Gin Gin Ala Gin Gin Thr His Gin Gin His Gin Ala Leu Gin 

1125 1130 1135 

Gin His Gin Gin Leu Pro Pro Ser Asn lie Tyr Gin Gin Met Ser Thr 

1140 1145 1150 

Thr Ser Glu lie Tyr Pro Thr Asn Thr Gly Pro Ser Arg Ser Val Tyr 

1155 1160 1165 

Ser Glu Gin Tyr Tyr Tyr Pro Lys Asp Lys Gin Arg His lie His lie 

1170 1175 1180 

Thr Glu Asn Lys Leu Ser Asn Cys His Thr Tyr Glu Ala Ala Pro Gly 
1185 1190 1195 1200 

Ala Lys Gin Ser Ser Pro lie Ser Ser Gin Phe Ala Ser Val Arg Arg 

1205 1210 1215 

Gin Gin Leu Pro Pro Asn Cys Ser lie Gly Arg Glu Ser Ala Arg Phe 

1220 1225 1230 

Lys Val Leu Asn Thr Asp Gin Gly Lys Asn Gin Gin Asn Leu Leu Asp 
1235 1240 1245 


) 


Leu Asp Gly Ser Ser Met Cys Tyr Asn Gly Leu Ala Asp Ser Gly Cys 

1250 1255 1260 

Gly Gly Ser Pro Ser Pro Met Ala Met Leu Met Ser His Glu Asp Glu 
1265 1270 1275 1280 

His Ala Leu Tyr His Thr Ala Asp Gly Asp Leu Asp Asp Met Glu Arg 

1285 1290 1295 

Leu Tyr Val Lys Val Asp Glu Gin Gin Pro Pro Gin Gin Gin Gin Gin 

1300 1305 1310 

Leu He Pro Leu Val Pro Gin His Pro Ala Glu Gly His Leu Gin Ser 

1315 1320 1325 

Trp Arg Asn Gin Ser Thr Arg Ser Ser Arg Lys Asn Gly Gin Glu Cys 

1330 1335 1340 

He Lys Glu Pro Ser Glu Leu He Tyr Ala Pro Gly Ser Val Ala Ser 
1345 1350 1355 1360 

Glu Arg Ser Leu Leu Ser Asn Ser Gly Ser Gly Thr Ser Ser Gin Pro 

1365 1370 1375 

Ala Gly His Asn Val 
1380 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ATGTACTATC TAGGTTTTTA CCACACTCAC ACACACACAC ACACATACAT AAATTTTGAT 60 
AAAATTCCTA ATGCCTCAAA TCTCGCTCCC GTGATAATCG AACATCCCAT CGATGTGGTG 120 
GTATCTAGGG GATCGCCAGC AACCCTCAAC TGTGGTGCAA AGCCATCTAC CGCCAAAATC 180 
ACATGGTACA AGGATGGACA GCCCGTAATC ACGAATAAGG AGCAAGTGAA CAGCCACCGG 240 
ATTGTTCTCG ACACGGGATC CCTGTTTCTT CTGAAAGTGA ATAGTGGAAA AAACGGAAAA 300 
GACAGCGATG CGGGAGCGTA CTATTGTGTG GCCAGCAACG AGCACGGAGA AGTGAAGTCG 360 
AACGAAGGAT CGTTAAAATT GGCGATGCTT CGCGAAGACT TTCGAGTTCG GCCAAGAACA 420 
GTTCAGGCTC TTGGTGGAGA GATGGCCGTT CTGGAATGCA GTCCGCCACG TGGATTCCCG 480 
GAGCCGGTTG TGAGCTGGCG GAAAGACGAC AAAGAGCTCC GAATTCAAGA CATGCCACGA 540 
TACACTCTAC ACTCTGACGG AAACCTCATC ATTGATCCGG TCGATCGAAG CGATTCTGGT 600 
ACTTATCAGT GTGTTGCCAA CAACATGGTC GGAGAACGGG TGTCCAATCC CGCAAGATTG 660 
AGTGTCTTTG AGAAACCAAA GTTTGAGCAA GAACCCAAGG ACATGACGGT CGACGTCGGA 720 
GCCGCAGTGC TGTTTGATTG TCGTGTGACT GGAGATCCTC AACCACAAAT TACGTGGAAA 780 
CGCAAAAATG AGCCGATGCC AGTTACACGT GCATACATTG CCAAGGATAA TCGGGGGTTG 840 
AGAATCGAAA GAGTTCAACC ATCAGACGAA GGTGAATACG TTTGCTATGC ACGAAATCCA 900 
GCGGGAACTC TTGAAGCATC TGCACATCTT CGTGTCCAGG CACCTCCATC CTTCCAGACA 960 
AAACCAGCAG ACCAGTCAGT TCCAGCTGGA GGCACGGCAA CTTTTGAATG CACCTTGGTC 1020 
GGTCAACCGA GTCCCGCCTA TTTTTGGAGC AAGGAAGGCC AACAGGATCT TCTTTTCCCA 1080 
AGTTATGTGT CCGCTGATGG TAGAACGAAA GTTTCACCAA CTGGAACATT GACAATTGAG 1140 
GAAGTTCGTC AAGTTGATGA GGGAGCTTAT GTGTGCGCTG GAATGAACTC GGCAGGAAGC 1200 
TCGTTGAGCA AGGCAGCTTT GAAAGCAACA TTTGAAACCA AAGGCCGTGT CCAAAAAAAA 1260 
AAGAGCAAAA TGGGCAAACA GAAACAAAAA AATGTTCAAT CAATTATCAA ATATTTAATT 132 0 
TCAGCCGTGA CCGGAAACAC ACCCGCCAAA CCACCACCAA CAATCGAGCA TGGTCATCAA 1380 
AATCAGACCC TTATGGTTGG ATCATCAGCC ATCCTTCCAT GTCAGGCTAG CGGAAAACCA 144 0 
ACTCCAGGAA TATCATGGCT CAGGGATGGG CTACCTATTG ACATTACAGA TAGTCGTATC 1500 
AGTCAACATT CAACGGGAAG TCTACATATT GCCGATTTAA AGAAACCTGA CACCGGAGTT 1560 
TACACTTGCA TTGCGAAGAA CGAGGATGGA GAGTCAACAT GGTCGGCATC TCTGACTGTT 162 0 


) 


} 


GAAGATCACA CTAGCAATGC ACAATTTGTT CGGATGCCGG ATCCATCGAA CTTCCCGTCT 1680 

TCTCCAACGC AACCCATTAT TGTCAATGTC ACTGATACCG AAGTAGAGCT CCACTGGAAT 174 0 

GCTCCCTCCA CATCTGGCGC AGGACCAATC ACTGGTTATA TCATTCAGTA CTACAGTCCA 1800 

GACCTCGGAC AGACGTGGTT TAACATTCCA GACTACGTGG CATCTACTGA ATATAGAATA 1860 

AAGGGTCTGA AACCATCTCA CTCGTATATG TTTGTGATTC GAGCAGAAAA TGAGAAAGGT 1920 

ATTGGAACGC CGAGTGTGTC GTCGGCTCTC GTTACCACTA GCAAGCCAGC AGCTCAAGTT 1980 

GCGCTTTCTG ACAAGAACAA AATGGACATG GCCATCGCTG AGAAGAGACT CACTTCGGAA 2 040 

CAACTCATAA AACTCGAGGA AGTGAAGACT ATTAATTCTA CGGCCGTTCG TTTGTTCTGG 2100 

AAGAAGAGGA AACTTGAAGA GCTGATTGAT GGTTACTACA TCAAGTGGAG AGGGCCTCCA 2160 

AGAACCAATG ATAATCAATA CGTGAATGTG ACCAGCCCTA GC AC CGAAAA CTATGTTGTT 2220 

TCAAATTTAA TGCCATTCAC CAACTATGAG TTTTTCGTGA TTCCTTATCA TTCCGGAGTT 2280 

CATAGTATTC ATGGAGCACC GAGTAATTCC ATGGACGTGT TGACCGCCGA AGCTCCACCT 234 0 

TCATTGCCAC CAGAGGATGT GCGAATCCGT ATGCTCAACC TGACCACTCT TCGTATCTCT 2400 

TGGAAAGCAC CAAAAGCCGA CGGCATCAAC GGAATTCTCA AAGGATTCCA AATTGTTATT 2460 

GTTGGTCAAG CGCCCAACAA CAATCGGAAC ATCACTACAA ACGAGAGAGC TGCCAGTGTT 2520 

ACTCTGTTCC ATTTAGTGAC TGGAATGACG TATAAAATTC GTGTAGCGGC TAGAAGCAAT 2580 

GGTGGAGTTG GAGTCTCACA TGGAACGAGT GAAGTCATCA TGAATCAAGA CACGCTGGAA 2640 

AAACACCTTG CTGCTCAACA AGAAAACGAA TCATTTTTGT ATGGGCTGAT CAATAAATCT 2700 

CATGTTCCTG TGATTGTCAT TGTTGCAATT CTGATTATTT TCGTAGTCAT CATTATAGCC 2760 

TATTGTTACT GGAGGAATAG CAGAAACAGT GATGGAAAGG ATCGAAGTTT TATAAAGATC 2820 

AATGATGGAA GTGTTCATAT GGCTTCGAAT AATCTTTGGG ATGTTGCACA AAATCCGAAT 2880 

CAGAATCCAA TGTACAACAC TGCTGGAAGA ATGACTATGA ACAATAGAAA TGGCCAGGCT 2940 

CTCTATTCGC TGACACCAAA TGCGCAAGAC TTTTTCAACA ATTGTGATGA CTACAGTGGA 3000 

ACGATGCACA GACCAGGATC CGAGCATCAC TATCATTATG CTCAACTGAC TGGCGGACCT 3060 

GGTAATGCGA TGTCTACTTT TTATGGAAAC CAATATCACG ATGATCCATC TCCATATGCC 3120 

ACCACAACAC TGGTCCTGTC GAACCAACAA CCAGCTTGGC TCAATGACAA AATGCTTCGC 3180 

GCGCCAGCAA TGCCAACAAA TCCCGTGCCA CCAGAGCCAC CGGCGCGATA TGCAGATCAT 3240 

ACCGCTGGAA GACGATCTCG ATCGAGCCGT GCATCCGATG GGAGAGGAAC TCTGAATGGC 33 00 

GGACTCCATC ACCGGACTAG CGGAAGTCAA CGGTCGGATA GTCCACCTCA CACAGATGTG 3360 

AGCTATGTTC AGCTTCACTC ATCCGATGGA ACTGGTAGTA GTAAGGAAAG AACTGGGGAG 3420 

CGGAGAACAC CACCGAATAA GACTCTGATG GACTTTATTC CGCCACCACC TTCCAATCCA 3480 

CCACCACCTG GAGGGCACGT TTATGACACA GCAACTAGGC GTCAGTTGAA TCGTGGAAGT 3540 

ACTCCACGAG AAGACACCTA CGATTCGGTC AGTGACGGAG CTTTTGCTCG GGTTGATGTG 3600 

AATGCAAGGC CAACGAGTCG GAATCGGAAT TTGGGAGGAA GGCCGCTGAA AGGGAAACGA 3660 

GACGACGATA GTCAGCGGTC TTCGTTGATG ATGGACGATG ATGGTGGATC TTCTGAAGCT 3720 

GACGGGGAGA ACTCTGAAGG AGACGTTCCG CGTGGAGGTG TTAGAAAAGC AGTTCCTCGA 3780 

ATGGGTATCT CTGCAAGTAC GCTGGCTCAT AGTTGTTACG GGACAAACGG CACTGCTCAA 3840 

CGATTCCGGT CAATTCCACG TAACAATGGA ATCGTCACAC AAGAACAAAC TTGA 38 94 


(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1297 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Tyr Tyr Leu Gly Phe Tyr His Thr His Thr His Thr His Thr Tyr 

15 10 15 

lie Asn Phe Asp Lys lie Pro Asn Ala Ser Asn Leu Ala Pro Val lie 

20 25 30 

lie Glu His Pro lie Asp Val Val Val Ser Arg Gly Ser Pro Ala Thr 

35 40 45 

Leu Asn Cys Gly Ala Lys Pro Ser Thr Ala Lys lie Thr Trp Tyr Lys 


50 
Asp Gly 
65 

He Val 

Lys Asn 

Asn Glu 

Met Leu 
130 
Gly Gly 
145 

Glu Pro 

Asp Met 

Pro Val 

Met Val 
210 
Lys Pro 
225 

Ala Ala 

He Thr 

He Ala 

Asp Glu 
290 
Glu Ala 
305 

Lys Pro 

Cys Thr 

Gly Gin 

Thr Lys 
370 
Val Asp 
385 

Ser Leu 

Val Gin 

Gin Ser 

Ala Lys 
450 
Met Val 
465 

Thr Pro 


Gin Pro 

Leu Asp 

Gly Lys 
100 
His Gly 
115 

Arg Glu 

Glu Met 

Val Val 

Pro Arg 
180 
Asp Arg 
195 

Gly Glu 

Lys Phe 

Val Leu 

Trp Lys 
260 
Lys Asp 
275 

Gly Glu 
Ser Ala 
Ala Asp 


Leu Val 
340 
Gin Asp 
355 

Val Ser 

Glu Gly 

Ser Lys 

Lys Lys 
420 
lie He 
435 

Pro Pro 
Gly Ser 
Gly He 


Val He 

70 
Thr Gly 
85 

Asp Ser 

Glu Val 

Asp Phe 

Ala Val 
150 
Ser Trp 
165 

Tyr Thr 

Ser Asp 

Arg Val 

Glu Gin 
230 
Phe Asp 
245 

Arg Lys 

Asn Arg 

Tyr Val 

His Leu 
310 
Gin Ser 
325 

Gly Gin 


55 

Thr Asn 

Ser Leu 

Asp Ala 

Lys Ser 
120 
Arg Val 
135 

Leu Glu 

Arg L Y S 

Leu His 

Ser Gly 
200 
Ser Asn 
215 

Glu Pro 

Cys Arg 

Asn Glu 

Gly Leu 
280 
Cys Tyr 
295 

Arg Val 
Val Pro 
Pro Ser 


Lys Glu Gin 
75 

Phe Leu Leu 
90 

Gly Ala Tyr 
105 

Asn Glu Gly 

Arg Pro Arg 

Cys Ser Pro 
155 

Asp Asp Lys 
170 

Ser Asp Gly 
185 

Thr Tyr Gin 

Pro Ala Arg 

Lys Asp Met 
235 

Val Thr Gly 

250 
Pro Met Pro 
265 

Arg He Glu 
Ala Arg Asn 


Leu Leu 

Pro Thr 

Ala Tyr 
390 
Ala Ala 
405 

Lys Ser 

Lys Tyr 

Pro Thr 

Ser Ala 
470 
Ser Trp 


Phe Pro 
360 
Gly Thr 
375 

Val Cys 

Leu Lys 

Lys Met 

Leu He 
440 
He Glu 
455 

He Leu 


Gin Ala Pro 
315 

Ala Gly Gly 

330 
Pro Ala Tyr 
345 

Ser Tyr Val 

Leu Thr He 

Ala Gly Met 
395 

Ala Thr Phe 

410 
Gly Lys Gin 
425 

Ser Ala Val 
His Gly His 


60 

Val Asn 

Lys Val 

Tyr Cys 

Ser Leu 
125 
Thr Val 
140 

Pro Arg 

Glu Leu 

Asn Leu 

Cys Val 
205 
Leu Ser 
220 

Thr Val 

Asp Pro 

Val Thr 

Arg Val 
285 
Pro Ala 
300 

Pro Ser 


Thr Ala 
Phe Trp 


Ser His Arg 
80 

Asn Ser Gly 
95 

Val Ala Ser 
110 

Lys Leu Ala 

Gin Ala Leu 

Gly Phe Pro 
160 

Arg He Gin 

175 
He He Asp 
190 

Ala Asn Asn 

Val Phe Glu 

Asp Val Gly 
240 

Gin Pro Gin 

255 
Arg Ala Tyr 
270 

Gin Pro Ser 

Gly Thr Leu 

Phe Gin Thr 
320 

Thr Phe Glu 

335 
Ser Lys Glu 
350 

Asp Gly Arg 


Ser Ala 
365 

Glu Glu Val Arg Gin 
380 

Asn Ser 


Glu Thr 

Lys Gin 

Thr Gly 
445 
Gin Asn 
460 

Ala Ser 


Pro Cys Gin 
475 

Leu Arg Asp Gly Leu Pro lie 


Ala Gly Ser 
400 

Lys Gly Arg 

415 
Lys Asn Val 
430 

Asn Thr Pro 

Gin Thr Leu 

Gly Lys Pro 
480 

Asp He Thr 


485 490 495 

Asp Ser Arg He Ser Gin His Ser Thr Gly Ser Leu His He Ala Asp 

500 505 510 

Leu Lys Lys Pro Asp Thr Gly Val Tyr Thr Cys He Ala Lys Asn Glu 

515 520 525 

Asp Gly Glu Ser Thr Trp Ser Ala Ser Leu Thr Val Glu Asp His Thr 

530 535 540 

Ser Asn Ala Gin Phe Val Arg Met Pro Asp Pro Ser Asn Phe Pro Ser 
545 550 555 560 

Ser Pro Thr Gin Pro He He Val Asn Val Thr Asp Thr Glu Val Glu 

565 570 575 

Leu His Trp Asn Ala Pro Ser Thr Ser Gly Ala Gly Pro He Thr Gly 

580 585 590 

Tyr He He Gin Tyr Tyr Ser Pro Asp Leu Gly Gin Thr Trp Phe Asn 

595 600 605 

He Pro Asp Tyr Val Ala Ser Thr Glu Tyr Arg He Lys Gly Leu Lys 

610 615 620 

Pro Ser His Ser Tyr Met Phe Val He Arg Ala Glu Asn Glu Lys Gly 
625 ' 630 635 640 

He Gly Thr Pro Ser Val Ser Ser Ala Leu Val Thr Thr Ser Lys Pro 

645 650 655 

Ala Ala Gin Val Ala Leu Ser Asp Lys Asn Lys Met Asp Met Ala He 

660 665 670 

Ala Glu Lys Arg Leu Thr Ser Glu Gin Leu He Lys Leu Glu Glu Val ^ 

675 680 685 

Lys Thr He Asn Ser Thr Ala Val Arg Leu Phe Trp Lys Lys Arg Lys 

690 695 700 

Leu Glu Glu Leu He Asp Gly Tyr Tyr He Lys Trp Arg Gly Pro Pro 
705 710 715 720 

Arg Thr Asn Asp Asn Gin Tyr Val Asn Val Thr Ser Pro Ser Thr Glu 

725 730 735 

Asn Tyr Val Val Ser Asn Leu Met Pro Phe Thr Asn Tyr Glu Phe Phe 

740 745 750 

Val He Pro Tyr His Ser Gly Val His Ser He His Gly Ala Pro Ser 

755 760 765 

Asn Ser Met Asp Val Leu Thr Ala Glu Ala Pro Pro Ser Leu Pro Pro 

770 775 780 

Glu Asp Val Arg He Arg Met Leu Asn Leu Thr Thr Leu Arg He Ser 
785 ^ 790 795 800 

Trp Lys Ala Pro Lys Ala Asp Gly He Asn Gly He Leu Lys Gly Phe 

805 810 815 

Gin He Val He Val Gly Gin Ala Pro Asn Asn Asn Arg Asn He Thr 

820 825 830 

Thr Asn Glu Arg Ala Ala Ser Val Thr Leu Phe His Leu Val Thr Gly 

835 840 845 

Met Thr Tyr Lys He Arg Val Ala Ala Arg Ser Asn Gly Gly Val Gly 

850 855 860 

Val Ser His Gly Thr Ser Glu Val He Met Asn Gin Asp Thr Leu Glu 
865 870 875 880 

Lys His Leu Ala Ala Gin Gin Glu Asn Glu Ser Phe Leu Tyr Gly Leu 

885 890 895 

He Asn Lys Ser His Val Pro Val He Val He Val Ala He Leu He 

900 905 910 

He Phe Val Val He He He Ala Tyr Cys Tyr Trp Arg Asn Ser Arg 


i 


915 920 925 

Asn Ser Asp Gly Lys Asp Arg Ser Phe He Lys He Asn Asp Gly Ser 

930 935 940 

Val His Met Ala Ser Asn Asn Leu Trp Asp Val Ala Gin Asn Pro Asn 
945 950 955 960 

Gin Asn Pro Met Tyr Asn Thr Ala Gly Arg Met Thr Met Asn Asn Arg 

965 970 975 

Asn Gly Gin Ala Leu Tyr Ser Leu Thr Pro Asn Ala Gin Asp Phe Phe 

980 985 990 

Asn Asn Cys Asp Asp Tyr Ser Gly Thr Met His Arg Pro Gly Ser Glu 

995 1000 1005 

His His Tyr His Tyr Ala Gin Leu Thr Gly Gly Pro Gly Asn Ala Met 

1010 1015 1020 

Ser Thr Phe Tyr Gly Asn Gin Tyr His Asp Asp Pro Ser Pro Tyr Ala 
1025 1030 1035 1040 

Thr Thr Thr Leu Val Leu Ser Asn Gin Gin Pro Ala Trp Leu Asn Asp 

1045 1050 1055 

Lys Met Leu Arg Ala Pro Ala Met Pro Thr Asn Pro Val Pro Pro Glu 

1060 1065 1070 

Pro Pro Ala Arg Tyr Ala Asp His Thr Ala Gly Arg Arg Ser Arg Ser 

1075 1080 1085 

Ser Arg Ala Ser Asp Gly Arg Gly Thr Leu Asn Gly Gly Leu His His 

1090 1095 1100 

Arg Thr Ser Gly Ser Gin Arg Ser Asp Ser Pro Pro His Thr Asp Val 
1105 1110 1115 1120 

Ser Tyr Val Gin Leu His Ser Ser Asp Gly Thr Gly Ser Ser Lys Glu 

1125 1130 . 1135 

Arg Thr Gly Glu Arg Arg Thr Pro Pro Asn Lys Thr Leu Met Asp Phe 

1140 1145 1150 

He Pro Pro Pro Pro Ser Asn Pro Pro Pro Pro Gly Gly His Val Tyr 

1155 1160 1165 

Asp Thr Ala Thr Arg Arg Gin Leu Asn Arg Gly Ser Thr Pro Arg Glu 

1170 H75 1180 

Asp Thr Tyr Asp Ser Val Scr Asp Gly Ala Phe Ala Arg Val Asp Val 
1185 1190 1195 1200 

Asn Ala Arg Pro Thr Ser Arg Asn Arg Asn Leu Gly Gly Arg Pro Leu 

1205 1210 1215 

Lys Gly Lys Arg Asp Asp Asp Ser Gin Arg Ser Ser Leu Met Met Asp 

1220 1225 1230 

Asp Asp Gly Gly Ser Ser Glu Ala Asp Gly Glu Asn Ser Glu Gly Asp 

1235 1240 1245 

Val Pro Arg Gly Gly Val Arg Lys Ala Val Pro Arg Met Gly He Ser 

1250 1255 1260 

Ala Ser Thr Leu Ala His Ser Cys Tyr Gly Thr Asn Gly Thr Ala Gin 
1265 1270 1275 1280 

Arg Phe Arg Ser He Pro Arg Asn Asn Gly He Val Thr Gin Glu Gin 
1285 1290 1295 

Thr 


INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 


i 


(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ATGAAATGGA AACATGTTCC TTTTTTGGTC ATGATATCAC TCCTCAGCTT ATCCCCAAAT 60 

CACCTGTTTC TGGCCCAGCT TATTCCAGAC CCTGAAGATG TAGAGAGGGG GAACGACCAC 12 0 

GGGACGCCAA TCCCCACCTC TGATAACGAT GACAATTCGC TGGGCTATAC AGGCTCCCGT 18 0 

CTTCGTCAGG AAGATTTTCC ACCTCGCATT GTTGAACACC CTTCAGACCT GATTGTCTCA 24 0 

AAAGGAGAAC CTGCAACTTT GAACTGCAAA GCTGAAGGCC GCCCCACACC CACTATTGAA 300 

TGGTACAAAG GGGGAGAGAG AGTGGAGACA GACAAAGATG ACCCTCGCTC ACACCGAATG 36 0 

TTGCTGCCGA GTGGATCTTT ATTTTTCTTA CGTATAGTAC ATGGACGGAA AAGTAGACCT 42 0 

GATGAAGGAG TCTATGTCTG TGTAGCAAGG AATTACCTTG GAGAGGCTGT GAGCCACAAT 480 
GCATCGCTGG AAGTAGCCAT ACTTCGGGAT GACTTCAGAC AAAACCCTTC GGATGTCATG 540 
GTTGCAGTAG GAGAGCCTGC AGTAATGGAA TGCCAACCTC CACGAGGCCA TCCTGAGCCC 6 00 

ACCATTTCAT GGAAGAAAGA TGGCTCTCCA CTGGATGATA AAGATGAAAG AATAACTATA 660 
CGAGGAGGAA AGCTCATGAT CACTTACACC CGTAAAAGTG ACGCTGGCAA ATATGTTTGT 720 
GTTGGTACCA ATATGGTTGG GGAACGTGAG AGTGAAGTAG CCGAGCTGAC TGTCTTAGAG 780 
AGACCATCAT TTGTGAAGAG ACCCAGTAAC TTGGCAGTAA CTGTGGATGA CAGTGCAGAA 840 
TTTAAATGTG AGGCCCGAGG TGACCCTGTA CCTACAGTAC GATGGAGGAA AGATGATGGA 900 
GAGCTGCCCA AATCCAGATA TGAAATCCGA GATGATCATA CCTTGAAAAT TAGGAAGGTG 960 

ACAGCTGGTG ACATGGGTTC ATACACTTGT GTTGCAGAAA ATATGGTGGG CAAAGCTGAA 1020 

GCATCTGCTA CTCTGACTGT TCAAGAACCT CCACATTTTG TTGTGAAACC CCGTGACCAG 1080 

GTTGTTGCTT TGGGACGGAC TGTAACTTTT CAGTGTGAAG CAACCGGAAA TCCTCAACCA 1140 

GCTATTTTCT GGAGGAGAGA AGGGAGTCAG AATCTACTTT TCTCATATCA ACCACCACAG 1200 

TCATCCAGCC GATTTTCAGT CTCCCAGACT GGCGACCTCA CAATTACTAA TGTCCAGCGA 1260 

TCTGATGTTG GTTATTACAT CTGCCAGACT TTAAATGTTG CTGGAAGCAT CATCACAAAG 1320 

GCATATTTGG AAGTTACAGA TGTGATTGCA GATCGGCCTC CCCCAGTTAT TCGACAAGGT 1380 

CCTGTGAATC AGACTGTAGC CGTGGATGGC ACTTTCGTCC TCAGCTGTGT GGCCACAGGC 1440 

AGTCCAGTGC CCACCATTCT GTGGAGAAAG GATGGAGTCC TCGTTTCAAC CCAAGACTCT 1500 

CGAATCAAAC AGTTGGAGAA TGGAGTACTG CAGATCCGAT ATGCTAAGCT GGGTGATACT 1560 

GGTCGGTACA CCTGCATTGC ATCAACCCCC AGTGGTGAAG CAACATGGAG TGCTTACATT 1620 

GAAGTTCAAG AATTTGGAGT TCCAGTTCAG CCTCCAAGAC CTACTGACCC AAATTTAATC 1680 

CCTAGTGCCC CATCAAAACC TGAAGTGACA GATGTCAGCA GAAATACAGT CACATTATCG 1740 

TGGCAACCAA ATTTGAATTC AGGAGCAACT CCAACATCTT ATATTATAGA AGCCTTCAGC 1800 

CATGCATCTG GTAGCAGCTG GCAGACCGTA GCAGAGAATG TGAAAACAGA AACATCTGCC 1860 

ATTAAAGGAC TCAAACCTAA TGCAATTTAC CTTTTCCTTG TGAGGGCAGC TAATGCATAT 1920 . 

GGAATTAGTG ATCCAAGCCA AATATCAGAT CCAGTGAAAA CACAAGATGT CCTACCAACA 1980 

AGTCAGGGGG TGGACCACAA GCAGGTCCAG AGAGAGCTGG GAAATGCTGT TCTGCACCTC 2040 

CACAACCCCA CCGTCCTTTC TTCCTCTTCC ATCGAAGTGC ACTGGACAGT AGATCAACAG 2100 

TCTCAGTATA TACAAGGATA TAAAATTCTC TATCGGCCAT CTGGAGCCAA CCACGGAGAA 2160 

TCAGACTGGT TAGTTTTTGA AGTGAGGACG CCAGCCAAAA ACAGTGTGGT AATCCCTGAT 2220 

CTCAGAAAGG GAGTCAACTA TGAAATTAAG GCTCGCCCTT TTTTTAATGA ATTTCAAGGA 22 80 

GCAGATAGTG AAATCAAGTT TGCCAAAACC CTGGAAGAAG CACCCAGTGC CCCACCCCAA 234 0 

GGTGTAACTG TATCCAAGAA TGATGGAAAC GGAACTGCAA TTCTAGTTAG TTGGCAGCCA 24 00 

CCTCCAGAAG ACACTCAAAA TGGAATGGTC CAAGAGTATA AGGTTTGGTG TCTGGGCAAT 2460 

GAAACTCGAT ACCACATCAA CAAAACAGTG GATGGTTCCA CCTTTTCCGT GGTCATTCCC 2520 

TTTCTTGTTC CTGGAATCCG ATACAGTGTG GAAGTGGCAG CCAGCACTGG GGCTGGGTCT 2580 

GGGGTAAAGA GTGAGCCTCA GTTCATCCAG CTGGATGCCC ATGGAAACCC TGTGTCACCT 264 0 

GAGGACCAAG TCAGCCTCGC TCAGCAGATT TCAGATGTGG TGAAGCAGCC GGCCTTCATA 2700 

GCAGGTATTG GAGCAGCCTG TTGGATCATC CTCATGGTCT TCAGCATCTG GCTTTATCGA 2760 

CACCGCAAGA AGAGAAACGG ACTTACTAGT ACCTACGCGG GTATCAGAAA AGTCCCGTCT 2 820 

TTTACCTTCA CACCAACAGT AACTTACCAG AGAGGAGGCG AAGCTGTCAG CAGTGGAGGG 2880 

AGGCCTGGAC TTCTCAACAT CAGTGAACCT GCCGCGCAGC CATGGCTGGC AGACACGTGG 2 94 0 

CCTAATACTG GCAACAACCA CAATGACTGC TCCATCAGCT GCTGCACGGC AGGCAATGGA 3000 

AACAGCGACA GCAACCTCAC TACCTACAGT CGCCCAGCTG ATTGTATAGC AAATTATAAC 3 060 


AACCAACTGG ATAACAAACA AACAAATCTG ATGCTCCCTG AGTCAACTGT TTATGGTGAT 
GTGGACCTTA GTAACAAAAT CAATGAGATG AAAACCTTCA ATAGCCCAAA TCTGAAGGAT 
GGGCGTTTTG TCAATCCATC AGGGCAGCCT ACTCCTTACG CCACCACTCA GCTCATCCAG 
TCAAACCTCA GCAACAACAT GAACAATGGC AGCGGGGACT CTGGCGAGAA GCACTGGAAA 
CCACTGGGAC AGCAGAAACA AGAAGTGGCA CCAGTTCAGT ACAACATCGT GGAGCAAAAC 
AAGCTGAACA AAGATTATCG AGCAAATGAC ACAGTTCCTC CAACTATCCC ATACAACCAA 
TCATACGACC AGAACACAGG AGGATCCTAC AACAGCTCAG ACCGGGGCAG TAGTACATCT 
GGGAGTCAGG GGCACAAGAA AGGGGCAAGA ACACCCAAGG TACCAAAACA GGGTGGCATG 
AACTGGGCAG ACCTGCTTCC TCCTCCCCCA GCACATCCTC CTCCACACAG CAATAGCGAA 
GAGTACAACA TTTCTGTAGA TGAAAGCTAT GACCAAGAAA TGCCATGTCC CGTGCCACCA 
GCAAGGATGT ATTTGCAACA AGATGAATTA GAAGAGGAGG AAGATGAACG AGGCCCCACT 
CCCCCTGTTC GGGGAGCAGC TTCTTCTCCA GCTGCCGTGT CCTATAGCCA TCAGTCCACT 
GCCACTCTGA CTCCCTCCCC ACAGGAAGAA CTCCAGCCCA TGTTACAGGA TTGTCCAGAG 
GAGACTGGCC ACATGCAGCA CCAGCCCGAC AGGAGACGGC AGCCTGTGAG TCCTCCTCCA 
CGACCACGGC CGATCTCCCC TCCACATACC TATGGCTACA TTTCAGGACC CCTGGTCTCA 
GATATGGATA CGGATGCGCC AGAAGAGGAA GAAGACGAAG CCGACATGGA GGTAGCCAAG 
ATGCAAACCA GAAGGCTTTT GTTACGTGGG CTTGAGCAGA CACCTGCCTC CAGTGTTGGG 
GACCTGGAGA GCTCTGTCAC GGGGTCCATG ATCAACGGCT GGGGCTCAGC CTCAGAGGAG 
GACAACATTT CCAGCGGACG CTCCAGTGTT AGTTCTTCGG ACGGCTCCTT TTTCACTGAT 
GCTGACTTTG CCCAGGCAGT CGCAGCAGCG GCAGAGTATG CTGGTCTGAA AGTAGCACGA 
CGGCAAATGC AGGATGCTGC TGGCCGTCGA CATTTTCATG CGTCTCAGTG CCCTAGGCCC 
ACAAGTCCCG TGTCTACAGA CAGCAACATG AGTGCCGCCG TAATGCAGAA AACCAGACCA 
GCCAAGAAAC TGAAACACCA GCCAGGACAT CTGCGCAGAG AAACCTACAC AGATGATCTT 
CCACCACCTC CTGTGCCGCC ACCTGCTATA AAGTCACCTA CTGCCCAATC CAAGACACAG 
CTGGAAGTAC GACCTGTAGT GGTGCCAAAA CTCCCTTCTA TGGATGCAAG AACAGACAGA 
TCATCAGACA GAAAAGGAAG CAGTTACAAG GGGAGAGAAG TGTTGGATGG AAGACAGGTT 
GTTGACATGC GAACAAATCC AGGTGATCCC AGAGAAGCAC AGGAACAGCA AAATGACGGG 
AAAGGACGTG GAAACAAGGC AGCAAAACGA GACCTTCCAC CAGCAAAGAC TCATCTCATC 
CAAGAGGATA TTCTACCTTA TTGTAGACCT ACTTTTCCAA CATCAAATAA TCCCAGAGAT 
CCCAGTTCCT CAAGCTCAAT GTCATCAAGA GGATCAGGAA GCAGACAAAG AGAACAAGCA 
' AATGTAGGTC GAAGAAATAT TGCAGAAATG CAGGTACTTG GAGGATATGA AAGAGGAGAA 
GATAATAATG AAGAATTAGA GGAAACTGAA AGCTGA 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1651 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 


Met 

Lys 

Trp 

Lys 

His 

Val 

Pro 

Phe 

Leu Val Met He Ser Leu Leu Ser 

1 




5 




10 15 

Leu 

Ser 

Pro 

Asn 

His 

Leu 

Phe 

Leu 

Ala Gin Leu He Pro Asp Pro Glu 




20 





25 30 

Asp 

Val 

Glu 

Arg 

Gly 

Asn 

Asp 

His 

Gly Thr Pro He Pro Thr Ser Asp 



35 





40 

45 

Asn 

Asp 

Asp 

Asn 

Ser 

Leu 

Gly 

Tyr 

Thr Gly Ser Arg Leu Arg Gin Glu 


50 





55 


60 

Asp 

Phe 

Pro 

Pro 

Arg 

He 

Val 

Glu 

His Pro Ser Asp Leu He Val Ser 

65 





70 



75 80 

Lys 

Gly 

Glu 

Pro 

Ala 

Thr 

Leu 

Asn 

Cys Lys Ala Glu Gly Arg Pro Thr 




85 




90 95 

Pro 

Thr 

He 

Glu 

Trp 

Tyr 

Lys 

Gly 

Gly Glu Arg Val Glu Thr Asp Lys 


3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4956 
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100 105 HO 

Asp Asp Pro Arg Ser His Arg Met Leu Leu Pro Ser Gly Ser Leu Phe 

115 120 125 

Phe Leu Arg He Val His Gly Arg Lys Ser Arg Pro Asp Glu Gly Val 

130 135 140 

Tyr Val Cys Val Ala Arg Asn Tyr Leu Gly Glu Ala Val Ser His Asn 
145 150 155 160 

Ala Ser Leu Glu Val Ala lie Leu Arg Asp Asp Phe Arg Gin Asn Pro 

165 170 175 

Ser Asp Val Met Val Ala Val Gly Glu Pro Ala Val Met Glu Cys Gin 

180 185 190 

Pro Pro Arg Gly His Pro Glu Pro Thr He Ser Trp Lys Lys Asp Gly 

195 200 205 

Ser Pro Leu Asp Asp Lys Asp Glu Arg He Thr He Arg Gly Gly Lys 

210 215 220 

Leu Met He Thr Tyr Thr Arg Lys Ser Asp Ala Gly Lys Tyr Val Cys 
225 230 235 240 

Val Gly Thr Asn Met Val Gly Glu Arg Glu Ser Glu Val Ala Glu Leu 

245 250 255 

Thr Val Leu Glu Arg Pro Ser Phe Val Lys Arg Pro Ser Asn Leu Ala 

260 265 270 

Val Thr Val Asp Asp Ser Ala Glu Phe Lys Cys Glu Ala Arg Gly Asp 

275 280 285 

Pro Val Pro Thr Val Arg Trp Arg Lys Asp Asp Gly Glu Leu Pro Lys 

290 295 300 

Ser Arg Tyr Glu He Arg Asp Asp His Thr Leu Lys He Arg Lys Val 
305 " 310 315 320 

Thr Ala Gly Asp Met Gly Ser Tyr Thr Cys Val Ala Glu Asn Met Val 

325 330 335 

Gly Lys Ala Glu Ala Ser Ala Thr Leu Thr Val Gin Glu Pro Pro His 

340 345 350 

Phe Val Val Lys Pro Arg Asp Gin Val Val Ala Leu Gly Arg Thr Val 

355 360 365 

Thr Phe Gin Cys Glu Ala Thr Gly Asn Pro Gin Pro Ala He Phe Trp 

370 375 380 

Arg Arg Glu Gly Ser Gin Asn Leu Leu Phe Ser Tyr Gin Pro Pro Gin 
385 390 395 400 

Ser Ser Ser Arg Phe Ser Val Ser Gin Thr Gly Asp Leu Thr He Thr 

405 410 415 

Asn Val Gin Arg Ser Asp Val Gly Tyr Tyr He Cys Gin Thr Leu Asn 

420 425 430 

Val Ala Gly Ser He He Thr Lys Ala Tyr Leu Glu Val Thr Asp Val 

435 440 445 

He Ala Asp Arg Pro Pro Pro Val He Arg Gin Gly Pro Val Asn Gin 

450 455 460 

Thr Val Ala Val Asp Gly Thr Phe Val Leu Ser Cys Val Ala Thr Gly 
465 470 475 480 

Ser Pro Val Pro Thr He Leu Trp Arg Lys Asp Gly Val Leu Val Ser 

485 490 495 

Thr Gin Asp Ser Arg He Lys Gin Leu Glu Asn Gly Val Leu Gin He 

500 505 510 

Arg Tyr Ala Lys Leu Gly Asp Thr Gly Arg Tyr Thr Cys He Ala Ser 

515 520 525 

Thr Pro Ser Gly Glu Ala Thr Trp Ser Ala Tyr lie Glu Val Gin Glu 


530 535 540 

Phe Gly Val Pro Val Gin Pro Pro Arg Pro Thr Asp Pro Asn Leu He 
545 550 555 560 

Pro Ser Ala Pro Ser Lys Pro Glu Val Thr Asp Val Ser Arg Asn Thr 

565 570 575 

Val Thr Leu Ser Trp Gin Pro Asn Leu Asn Ser Gly Ala Thr Pro Thr 

580 585 590 

Ser Tyr He He Glu Ala Phe Ser His Ala Ser Gly Ser Ser Trp Gin 

595 600 605 

Thr Val Ala Glu Asn Val Lys Thr Glu Thr Ser Ala He Lys Gly Leu 

610 615 620 

Lys Pro Asn Ala He Tyr Leu Phe Leu Val Arg Ala Ala Asn Ala Tyr 
625 630 635 640 

Gly He Ser Asp Pro Ser Gin He Ser Asp Pro Val Lys Thr Gin Asp 

645 650 655 

Val Leu Pro Thr Ser Gin Gly Val Asp His Lys Gin Val Gin Arg Glu 

660 665 670 

Leu Gly Asn Ala Val Leu His Leu His Asn Pro Thr Val Leu Ser Ser 

675 680 685 

Ser Ser He Glu Val His Trp Thr Val Asp Gin Gin Ser Gin Tyr lie 

690 695 700 

Gin Gly Tyr Lys He Leu Tyr Arg Pro Ser Gly Ala Asn His Gly Glu 
705 * 710 715 720 

Ser Asp Trp Leu Val Phe Glu Val Arg Thr Pro Ala Lys Asn Ser Val 

725 730 735 

Val He Pro Asp Leu Arg Lys Gly Val Asn Tyr Glu He Lys Ala Arg 

740 745 750 

Pro Phe Phe Asn Glu Phe Gin Gly Ala Asp Ser Glu He Lys Phe Ala 

755 760 765 

Lys Thr Leu Glu Glu Ala Pro Ser Ala Pro Pro Gin Gly Val Thr Val 

770 775 780 

Ser Lys Asn Asp Gly Asn Gly Thr Ala He Leu Val Ser Trp Gin Pro 
785 790 795 800 

Pro Pro Glu Asp Thr Gin Asn Gly Met Val Gin Glu Tyr Lys Val Trp 

805 810 815 

Cys Leu Gly Asn Glu Thr Arg Tyr His He Asn Lys Thr Val Asp Gly 

820 825 830 

Ser Thr Phe Ser Val Val He Pro Phe Leu Val Pro Gly He Arg Tyr 

835 840 845 

Ser Val Glu Val Ala Ala Ser Thr Gly Ala Gly Ser Gly Val Lys Ser 

850 855 860 

Glu Pro Gin Phe He Gin Leu Asp Ala His Gly Asn Pro Val Ser Pro 
865 870 875 880 

Glu Asp Gin Val Ser Leu Ala Gin Gin He Ser Asp Val Val Lys Gin 

885 890 895 

Pro Ala Phe He Ala Gly He Gly Ala Ala Cys Trp He He Leu Met 

900 905 910 

Val Phe Ser He Trp Leu Tyr Arg His Arg Lys Lys Arg Asn Gly Leu 

915 920 925 

Thr Ser Thr Tyr Ala Gly He Arg Lys Val Pro Ser Phe Thr Phe Thr 

930 935 940 

Pro Thr Val Thr Tyr Gin Arg Gly Gly Glu Ala Val Ser Ser Gly Gly 
945 950 955 960 

Arg Pro Gly Leu Leu Asn He Ser Glu Pro Ala Ala Gin Pro Trp Leu 


965 970 975 

Ala Asp Thr Trp Pro Asn Thr Gly Asn Asn His Asn Asp Cys Ser He 

980 985 990 

Ser Cys Cys Thr Ala Gly Asn Gly Asn Ser Asp Ser Asn Leu Thr Thr 

995 1000 1005 

Tyr Ser Arg Pro Ala Asp Cys He Ala Asn Tyr Asn Asn Gin Leu Asp 

1010 1015 1020 

Asn Lys Gin Thr Asn Leu Met Leu Pro Glu Ser Thr Val Tyr Gly Asp 
1025 1030 1035 1040 

Val Asp Leu Ser Asn Lys He Asn Glu Met Lys Thr Phe Asn Ser Pro 

1045 " 1050 1055 

Asn Leu Lys Asp Gly Arg Phe Val Asn Pro Ser Gly Gin Pro Thr Pro 

1060 1065 1070 

Tyr Ala Thr Thr Gin Leu He Gin Ser Asn Leu Ser Asn Asn Met Asn 

1075 1080 1085 

Asn Gly Ser Gly Asp Ser Gly Glu Lys His Trp Lys Pro Leu Gly Gin 

1090 1095 HOO 

Gin Lys Gin Glu Val Ala Pro Val Gin Tyr Asn He Val Glu Gin Asn 
1105 HIO 1115 H20 

Lys Leu Asn Lys Asp Tyr Arg Ala Asn Asp Thr Val Pro Pro Thr He 

1125 H30 1135 

Pro Tyr Asn Gin Ser Tyr Asp Gin Asn Thr Gly Gly Ser Tyr Asn Ser 

1140 H45 1150 

Ser Asp Arg Gly Ser Ser Thr Ser Gly Ser Gin Gly His Lys Lys Gly 

1155 H60 H65 

Ala Arg Thr Pro Lys Val Pro Lys Gin Gly Gly Met Asn Trp Ala Asp 

1170 H75 H80 

Leu Leu Pro Pro Pro Pro Ala His Pro Pro Pro His Ser Asn Ser Glu 
1185 H90 H95 1200 

Glu Tyr Asn He Ser Val Asp Glu Ser Tyr Asp Gin Glu Met Pro Cys 

1205 1210 1215 

Pro Val Pro Pro Ala Arg Met Tyr Leu Gin Gin Asp Glu Leu Glu Glu 

1220 1225 1230 

Glu Glu Asp Glu Arg Gly Pro Thr Pro Pro Val Arg Gly Ala Ala Ser 

1235 1240 1245 

Ser Pro Ala Ala Val Ser Tyr Ser His Gin Ser Thr Ala Thr Leu Thr 

1250 1255 1260 

Pro Ser Pro Gin Glu Glu Leu Gin Pro Met Leu Gin Asp Cys Pro Glu 
1265 1270 1275 1280 

Glu Thr Gly His Met Gin His Gin Pro Asp Arg Arg Arg Gin Pro Val 

1285 1290 1295 

Ser Pro Pro Pro Pro Pro Arg Pro He Ser Pro Pro His Thr Tyr Gly 

1300 1305 1310 

Tyr He Ser Gly Pro Leu Val Ser Asp Met Asp Thr Asp Ala Pro Glu 

1315 1320 1325 

Glu Glu Glu Asp Glu Ala Asp Met Glu Val Ala Lys Met Gin Thr Arg 

1330 1335 1340 

Arg Leu Leu Leu Arg Gly Leu Glu Gin Thr Pro Ala Ser Ser Val Gly 
1345 1350 1355 1360 

Asp Leu Glu Ser Ser Val Thr Gly Ser Met He Asn Gly Trp Gly Ser 

1365 1370 1375 

Ala Ser Glu Glu Asp Asn He Ser Ser Gly Arg Ser Ser Val Ser Ser 

1380 1385 1390 

Ser Asp Gly Ser Phe Phe Thr Asp Ala Asp Phe Ala Gin Ala Val Ala 


I ) 
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1395 1400 1405 

Ala Ala Ala Glu Tyr Ala Gly Leu Lys Val Ala Arg Arg Gin Met Gin 

1410 1415 1420 

Asp Ala Ala Gly Arg Arg His Phe His Ala Ser Gin Cys Pro Arg Pro 
1425 1430 1435 1440 

Thr Ser Pro Val Ser Thr Asp Ser Asn Met Ser Ala Ala Val Met Gin 

1445 1450 1455 

Lys Thr Arg Pro Ala Lys Lys Leu Lys His Gin Pro Gly His Leu Arg 

1460 1465 1470 

Arg Glu Thr Tyr Thr Asp Asp Leu Pro Pro Pro Pro Val Pro Pro Pro 

1475 1480 1485 

Ala lie Lys Ser Pro Thr Ala Gin Ser Lys Thr Gin Leu Glu Val Arg 

1490 1495 1500 

Pro Val Val Val Pro Lys Leu Pro Ser Met Asp Ala Arg Thr Asp Arg 
1505 1510 1515 1520 

Ser Ser Asp Arg Lys Gly Ser Ser Tyr Lys Gly Arg Glu Val Leu Asp 

1525 1530 1535 

Gly Arg Gin Val Val Asp Met Arg Thr Asn Pro Gly Asp Pro Arg Glu 

1540 1545 1550 

Ala Gin Glu Gin Gin Asn Asp Gly Lys Gly Arg Gly Asn Lys Ala Ala 

1555 1560 1565 

Lys Arg Asp Leu Pro Pro Ala Lys Thr His Leu lie Gin Glu Asp lie 

1570 1575 1580 

Leu Pro Tyr Cys Arg Pro Thr Phe Pro Thr Ser Asn Asn Pro Arg Asp 
1585 1590 1595 1600 

Pro Ser Ser Ser Ser Ser Met Ser Ser Arg Gly Ser Gly Ser Arg Gin 

1605 1610 1615 

Arg Glu Gin Ala Asn Val Gly Arg Arg Asn He Ala Glu Met Gin Val 

1620 1625 1630 

Leu Gly Gly Tyr Glu Arg Gly Glu Asp Asn Asn Glu Glu Leu Glu Glu 

1635 1640 1645 

Thr Glu Ser 
1650 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 855.. 1187 

(D) OTHER INFORMATION: /note= "N signifies gap in sequence" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CAGATTGTTG CTCAAGGTCG AACAGTGACA TTTCCCTGTG AAACTAAAGG AAACCCACAG 60 
CCAGCTGTTT TTTGGCAGAA AGAAGGCAGC CAGAACCTAC TTTTCCCAAA CCAACCCCAG 120 
CAGCCCAACA GTAGATGCTC AGTGTCACCA ACTGGAGACC TCACAATCAC CAACATTCAA 180 
CGTTCCGACG CGGGTTACTA CATCTGCCAG GCTTTAACTG TGGCAGGAAG CATTTTAGCA 240 
AAAGCTCAAC TGGAGGTTAC TGATGTTTTG ACAGATAGAC CTCCACCTAT AATTCTACAA 300 
GGCCCAGCCA ACCAAACGCT GGCAGTGGAT GGTACAGCGT TACTGAAATG TAAAGCCACT 360 
GGTGATCCTC TTCCTGTAAT TAGCTGGTTA AAGGAGGGAT TTACTTTTCC GGGTAGAGAT 420 
CCAAGAGCAA CAATTCAAGA GCAAGGCACA CTGCAGATTA AGAATTTACG GATTTCTGAT 480 


i 


ACTGGCACTT ATACTTGTGT GGCTACAAGT 
CTGGATGTGA CAGAGTCTGG AGCAACAATC 
GGGCCACCAT CCAAACCGCA AGTCACTGAT 
CAGCCAGGTA CCCCTGGAAC CCTTCCAGCA 
TCAGTGAGCA ACAGCTGGCA GACCGTGGCA 
AGAGGACTGC GGCCCAATAC AATCTACTTA 
TCAGTGACCC AAGTNAAACC ACAGAAAAAC 
CCTCCCCCCC CAGTCCAGCC CCTTCCTGGC 
CAAGAAAATG GCTATGACAG TGATAGCTGG 
CACCAAGGTC TGGAAGATGA ACTGGAAGAA 
CGAGGCGTGG CTTCTTCTCC TGCTATCTCC 
CCATCCCCAC GGGAAGAGAT GCAACCCATG 
AAGACCTCGA CCTACCAGCC CATTTTCTAC 
AAGTCAGAGG CCTCGGCCCA CTAAAAAACA 

(2) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 434 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
( ix) FEATURE : 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 285.. 396 

(D) OTHER INFORMATION: /note= "Xaa signifies gap in sequence" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 


Gin 

He 

Val 

Ala 

Gin 

Gly 

Arg 

Thr 

Val Thr 

Phe 

Pro 

Cys 

Glu 

Thr 

Lys 

1 




5 




10 





15 


Gly 

Asn 

Pro 

Gin 

Pro 

Ala 

Val 

Phe 

Trp Gin 

Lys 

Glu 

Gly 

Ser 

Gin 

Asn 




20 





25 




30 



Leu 

Leu 

Phe 

Pro 

Asn 

Gin 

Pro 

Gin 

Gin Pro 

Asn 

Ser 

Arg 

Cys 

Ser 

Val 



35 





40 




45 




Ser 

Fro 

Thr 

Gly 

Asp 

Leu 

Thr 

He 

Thr Asn 

He 

Gin 

Arg 

Ser 

Asp 

Ala 


50 





55 




60 





Gly 

Tyr 

Tyr 

He 

Cys 

Gin 

Ala 

Leu 

Thr Val 

Ala 

Gly 

Ser 

He 

Leu 

Ala 

65 





70 




75 





80 

Lys 

Ala 

Gin 

Leu 

Glu 

Val 

Thr 

Asp 

Val Leu 

Thr 

Asp 

Arg 

Pro 

Pro 

Pro 





85 




90 





95 


He 

He 

Leu 

Gin 

Gly 

Pro 

Ala 

Asn 

Gin Thr 

Leu 

Ala 

Val 

Asp 

Gly 

Thr 




100 





105 




110 



Ala 

Leu 

Leu 

Lys 

Cys 

Lys 

Ala 

Thr 

Gly Asp 

Pro 

Leu 

Pro 

Val 

He 

Ser 



115 





120 




125 




Trp 

Leu 

Lys 

Glu 

Gly 

Phe 

Thr 

Phe 

Pro Gly 

Arg 

Asp 

Pro 

Arg 

Ala 

Thr 


130 





135 




140 





He 

Gin 

Glu 

Gin 

Gly 

Thr 

Leu 

Gin 

He Lys 

Asn 

Leu 

Arg 

He 

Ser 

Asp 

145 





150 




155 





160 

Thr 

Gly 

Thr 

Tyr 

Thr 

Cys 

Val 

Ala 

Thr Ser 

Ser 

Ser 

Gly 

Glu 

Ala 

Ser 





165 




170 





175 


Trp 

Ser 

Ala 

Val 

Leu 

Asp 

Val 

Thr 

Glu Ser 

Gly 

Ala 

Thr 

He 

Ser 

Lys 




180 





185 




190 



Asn 

Tyr 

Asp 

Leu 

Ser 

As P 

Leu 

Pro 

Gly Pro 

Pro 

Ser 

Lys 

Pro 

Gin 

Val 



195 





200 




205 




Thr 

Asp 

Val 

Thr 

Lys 

Asn 

Ser 

Val 

Thr Leu 

Ser 

Trp 

Gin 

Pro 

Gly 

Thr 


TCAAGTGGAG 

AGGCTTCCTG 

GAGTGCAGTG 

540 

AGTAAAAACT 

ATGATTTAAG 

TGACCTGCCA 

600 

GTTACTAAGA 

ACAGTGTCAC 

CTTGTCCTGG 

660 

AGTGCATATA 

TCATTGAGGC 

TTTCAGCCAA 

720 

AACCATGTAA 

AGACCACCCT 

CTATACTGTA 

780 

TTCATGGTCA 

GAGCGATCAA 

CCCCAAGGTY 

840 

AATGGATCCA 

CTTGGGCCAA 

TGTCCCTCTA 

900 

ACGGAGCTGG 

AACACTATGC 

AGTGGAACAA 

960 

TGCCCACCAT 

TGCCAGTACA 

AACTTACTTA 

1020 

GATGATGATA 

GGGTCCCAAC 

ACCTCCTGTT 

1080 

TTTGGACAGC 

AGTCCACTGC 

AACTCTTACT 

1140 

CTGCAGGCTT 

CACCTNTTTA 

CCTCCTCTCA 

1200 

TGACAGTAAC 

ACCAGTGCAG 

CCCTGAGTCA 

1260 

CAAGGGAGGG 



1300 



210 



215 



220 




Pro 

Gly 

Thr 

Leu Pro Ala 

Ser 

Ala 

Tyr 

He He Glu Ala 

Phe 

Ser 

Gin 

225 


230 




235 . 



240 

Ser 

Val 

Ser 

Asn Ser Trp 

Gin 

Thr 

Val 

Ala Asn His Val 

Lys 

Thr 

Thr 




245 




250 


255 


Leu 

Tyr 

Thr 

Val Arg Gly 

Leu 

Arg 

Pro 

Asn Thr He Tyr 

Leu 

Phe 

Met 




260 



265 


270 



Val 

Arg 

Ala 

He Asn Pro 

Lys 

Val 

Ser 

Val Thr Gin Xaa 

Lys 

Pro 

Gin 



275 



280 


285 




Lys 

Asn 

Asn 

Gly Ser Thr 

Trp 

Ala 

Asn 

Val Pro Leu Pro 

Pro 

Pro 

Pro 


290 



295 



300 




Val 

Gin 

Pro 

Leu Pro Gly 

Thr 

Glu 

Leu 

Glu His Tyr Ala 

Val 

Glu 

Gin 

305 



310 




315 



320 

Gin 

Glu 

Asn 

Gly Tyr Asp 

Ser 

Asp 

Ser 

Trp Cys Pro Pro 

Leu 

Pro 

Val 




325 




330 


335 


Gin 

Thr 

Tyr 

Leu His Gin 

Gly 

Leu 

Glu 

Asp Glu Leu Glu 

Glu 

Asp 

Asp 




340 



345 


350 



Asp 

Arg 

Val 

Pro Thr Pro 

Pro 

Val 

Arg 

Gly Val Ala Ser 

Ser 

Pro 

Ala 



355 



360 


365 




He 

Ser 

Phe 

Gly Gin Gin 

Ser 

Thr 

Ala 

Thr Leu Thr Pro 

Ser 

Pro 

Arg 


370 



375 



380 




Glu 

Glu 

Met 

Gin Pro Met 

Leu 

Gin 

Ala 

Ser Pro Xaa Phe 

Thr 

Ser 

Ser 

385 



390 




395 



400 

Gin 

Arg 

Pro 

Arg Pro Thr 

Ser 

Pro 

Phe 

Ser Thr Asp Ser 

Asn 

Thr 

Ser 




405 




410 


415 


Ala 

Ala 

Leu 

Ser Gin Ser 

Gin 

Arg 

Pro 

Arg Pro Thr Lys 

Lys 

His 

Lys 




420 



425 


430 



Gly 

Gly 











(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE3S : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCCCAGGCAG TTGCTGCAGC TGCGGAGTAT GCGGGCCTGA AAGTGGCTCG CCGCCAAATG 60 
CAAGATGCTG CTGGCCGCCG CCACTTCCAT GCCTCTCAGT GCCCAAGGCC CACGAGTCCT 120 
GTGTCCACAG ACAGCAACAT GAGTGCTGTT GTGATCCAGA AAGCCAGACC CGCCAAGAAG 180 
CAGAAACACC AGCCAGGACA TCTGCGCAGG GAAGCCTACG CAGATGATCT TCCACCCCCT 24 0 

CCAGTGCCAC CACCTGCTAT AAAATCGCCC ACTGTCCAGT CCAAGGCACA GCTGGAGGTA 3 00 

CGGCCTGTCA TGGTGCCAAA ACTCGCGTCT ATAGAAGCAA GGACAGATAG ATCGTCAGAC 360 
AGAAAAGGAG GCAGTTACAA GGGGAGAGAA GCTCTGGATG GAAGACAAGT CACTGACCTG 42 0 

CGAACAAATC CAAGTGACCC CAGA 444 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 


J 


Ala 

Gin 

Ala 

Val 

Ala 

Ala 

Ala 

Ala 

Glu Tyr Ala Gly 

Leu 

Lys 

Val 

Ala 

1 




5 




10 



15 


Arg 

Arg 

Gin 

Met 

Gin 

Asp 

Ala 

Ala 

Gly Arg Arg His 

Phe 

His 

Ala 

Ser 




20 





25 


30 



Gin 

Cys 

Pro 

Arg 

Pro 

Thr 

Ser 

Pro 

Val Ser Thr Asp 

Ser 

Asn 

Met 

Ser 



35 





40 


45 




Ala 

Val 

Val 

He 

Gin 

Lys 

Ala 

Arg 

Pro Ala Lys Lys 

Gin 

Lys 

His 

Gin 


50 





55 


60 





Pro 

Gly 

His 

Leu 

Arg 

Arg 

Glu 

Ala 

Tyr Ala Asp Asp 

Leu 

Pro 

Pro 

Pro 

65 





70 



75 




80 

Pro 

Val 

Pro 

Pro 

Pro 

Ala 

He 

Lys 

Ser Pro Thr Val 

Gin 

Ser 

Lys 

Ala 





85 




90 



95 


Gin 

Leu 

Glu 

Val 

Arg 

Pro 

Val 

Met 

Val Pro Lys Leu 

Ala 

Ser 

He 

Glu 




100 





105 


110 



Ala 

Arg 

Thr 

Asp 

Arg 

Ser 

Ser 

Asp 

Arg Lys Gly Gly 

Ser 

Tyr 

Lys Gly 



115 





120 


125 




Arg 

Glu 

Ala 

Leu 

Asp 

Gly 

Arg 

Gin 

Val Thr Asp Leu 

Arg 

Thr 

Asn 

Pro 


130 





135 


140 





Ser 

Asp 

Pro 

Arg 











145 


(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 909 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 


Met 

Phe 

Asn 

Arg 

Lys 

Thr 

Leu 

Leu 

Cys 

Thr 

He Leu Leu 

Val 

Leu 

Gin 

1 




5 





10 



15 


Ala 

Val 

He 

Arg 

Ser 

Phe 

Cys 

Glu 

Asp 

Ala 

Ser Asn Leu 

Ala 

Pro 

Val 




20 





25 



30 



He 

He 

Glu 

His 

Pro 

He 

Asp 

Val 

Val 

Val 

Ser Arg Gly 

Ser 

Fro 

Ala 



35 





40 



45 




Thr 

Leu 

Asn 

Cys 

Gly 

Ala 

Lys 

Pro 

Ser 

Thr 

Ala Lys He 

Thr 

Trp 

Tyr 


50 





55 




60 




Lys 

Asp 

Gly 

Gin 

Pro 

Val 

He 

Thr 

Asn 

Lys 

Glu Gin Val 

Asn 

Ser 

His 

65 





70 





75 



80 

Arg 

He 

Val 

Leu 

Asp 

Thr 

Gly 

Ser 

Leu 

Phe 

Leu Leu Lys 

Val 

Asn 

Ser 





85 





90 



95 


Gly 

Lys 

Asn 

Gly 

Lys 

Asp 

Ser 

Asp 

Ala 

Gly 

Ala Tyr Tyr 

Cys 

Val 

Ala 




100 





105 



110 



Ser 

Asn 

Glu 

His 

Gly 

Glu 

Val 

Lys 

Ser 

Asn 

Glu Gly Ser 

Leu 

Lys 

Leu 



115 





120 



125 




Ala 

Met 

Leu 

Arg 

Glu 

Asp 

Phe 

Arg 

Val 

Arg 

Pro Arg Thr Val 

Gin 

Ala 


130 





135 




140 




Leu 

Gly 

Gly 

Glu 

Met 

Ala 

Val 

Leu 

Glu 

Cys 

Ser Pro Pro Arg 

Gly 

Phe 

145 





150 





155 



160 

Pro 

Glu 

Pro 

Val 

Val 

Ser 

Trp 

Arg 

Lys 

Asp 

Asp Lys Glu 

Leu 

Arg 

He 





165 





170 



175 


Gin 

Asp 

Met 

Pro 

Arg 

Tyr 

Thr 

Leu 

His 

Ser 

Asp Gly Asn Leu 

lie 

He 




180 





185 



190 




Asp Pro Val Asp Arg Ser Asp Ser Gly. Thr Tyr Gin Cys Val Ala Asn 

195 200 205 

Asn Met Val Gly Glu Arg Val Ser Asn Pro Ala Arg Leu Ser Val Phe 

210 215 220 

Glu Lys Pro Lys Phe Glu Gin Glu Pro Lys Asp Met Thr Val Asp Val 
225 230 235 240 

Gly Ala Ala Val Leu Phe Asp Cys Arg Val Thr Gly Asp Pro Gin Pro 

245 250 255 

Gin He Thr Trp Lys Arg Lys Asn Glu Pro Met Pro Val Thr Arg Ala 

260 265 270 

Tyr He Ala Lys Asp Asn Arg Gly Leu Arg He Glu Arg Val Gin Pro 

275 280 285 

Ser Asp Glu Gly Glu Tyr Val Cys Tyr Ala Arg Asn Pro Ala Gly Thr 

290 295 300 

Leu Glu Ala Ser Ala His Leu Arg Val Gin Ala Pro Pro Ser Phe Gin 
305 310 315 320 

Thr Lys Pro Ala Asp Gin Ser Val Pro Ala Gly Gly Thr Ala Thr Phe 

325 330 335 

Glu Cys Thr Leu Val Gly Gin Pro Ser Pro Ala Tyr Phe Trp Ser Lys 

340 345 350 

Glu Gly Gin Gin Asp Leu Leu Phe Pro Ser Tyr Val Ser Ala Asp Gly 

355 360 365 

Arg Thr Lys Val Ser Pro Thr Gly Thr Leu Thr He Glu Glu Val Arg 

370 375 380 

Gin Val Asp Glu Gly Ala Tyr Val Cys Ala Gly Met Asn Ser Ala Gly 
385 390 395 400 

Ser Ser Leu Ser Lys Ala Ala Leu Lys Val Thr Thr Lys Ala Val Thr 

405 410 415 

Gly Asn Thr Pro Ala Lys Pro Pro Pro Thr He Glu His Gly His Gin 

420 425 430 

Asn Gin Thr Leu Met Val Gly Ser Ser Ala He Leu Pro Cys Gin Ala 

435 440 445 

Ser Gly Lys Pro Thr Pro Gly He Ser Trp Leu Arg Asp Gly Leu Pro 

450 455 460 

He Asp He Thr Asp Ser Arg He Ser Gin His Ser Thr Gly Ser Leu 
465 470 475 480 

His lie Ala Asp Leu Lys Lys Pro Asp Thr Gly Val Tyr Thr Cys He 

485 490 495 

Ala Lys Asn Glu Asp Gly Glu Ser Thr Trp Ser Ala Ser Leu Thr Val 

500 505 510 ' 

Glu Asp His Thr Ser Asn Ala Gin Phe Val Arg Met Pro Asp Pro Ser 

515 520 525 

Asn Phe Pro Ser Ser Pro Thr Gin Pro He He Val Asn Val Thr Asp 

530 535 540 

Thr Glu Val Glu Leu His Trp Asn Ala Pro Ser Thr Ser Gly Ala Gly 
545 550 555 560 

Pro He Thr Gly Tyr He lie Gin Tyr Tyr Ser Pro Asp Leu Gly Gin 

565 570 575 

Thr Trp Phe Asn He Pro Asp Tyr Val Ala Ser Thr Glu Tyr Arg He 

580 585 590 

Lys Gly Leu Lys Pro Ser His Ser Tyr Met Phe Val He Arg Ala Glu 

595 600 605 

Asn Glu Lys Gly He Gly Thr Pro Ser Val Ser Ser Ala Leu Val Thr 
.610 615 620 


Thr Ser Lys Pro Ala Ala Gin Val Ala Leu Ser Asp Lys Asn Lys Met 
625 630 635 640 

Asp Met Ala lie Ala Glu Lys Arg Leu Thr Ser Glu Gin Leu lie Lys 

645 650 655 

Leu Glu Glu Val Lys Thr lie Asn Ser Thr Ala Val Arg Leu Phe Trp 

660 665 670 

Lys Lys Arg Lys Leu Glu Glu Leu He Asp Gly Tyr Tyr He Lys Trp 

675 680 685 

Arg Gly Pro Pro Arg Thr Asn Asp Asn Gin Tyr Val Asn Val Thr Ser 

690 695 700 

Pro Ser Thr Glu Asn Tyr Val Val Ser Asn Leu Met Pro Phe Thr Asn 
705 710 715 720 

Tyr Glu Phe Phe Val He Pro Tyr His Ser Gly Val His Ser He His 

725 730 735 

Gly Ala Pro Ser Asn Ser Met Asp Val Leu Thr Ala Glu Ala Pro Pro 

740 745 750 

Ser Leu Pro Pro Glu Asp Val Arg He Arg Met Leu Asn Leu Thr Thr 

755 760 765 

Leu Arg He Ser Trp Lys Ala Pro Lys Ala Asp Gly He Asn Gly He 

770 775 780 

Leu Lys Gly Phe Gin He Val lie Val Gly Gin Ala Pro Asn Asn Asn 
785 790 795 800 

Arg Asn He Thr Thr Asn Glu Arg Ala Ala Ser Val Thr Leu Phe His 

805 810 815 

Leu Val Thr Gly Met Thr Tyr Lys He Arg Val Ala Ala Arg Ser Asn 

820 825 830 

Gly Gly Val Gly Val Ser His Gly Thr Ser Glu Val He Met Asn Gin 

835 840 845 

Asp Thr Leu Glu Lys His Leu Ala Ala Gin Gin Glu Asn Glu Ser Phe 

850 855 860 

Leu Tyr Gly Leu He Asn Lys Ser His Val Pro Val He Val He Val 
865 870 875 880 

Ala He Leu He He Phe Val Val He He He Ala Tyr Cys Tyr Trp 

885 890 895 

Arg Asn Ser Arg Asn Ser Asp Gly Lys Asp Arg Ser Phe 
900 905 


